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THE PERFECT LEARNER 


ANATOL RAPOPORT 
UNIVERSITY OF MICHIGAN 


As a ‘‘base line’? of memorization performance, the behavior of a 
‘*perfect learner’’ is considered. He is characterized by a perfect mem- 
ory and by the ability to choose the best search procedure in problems 
where the correct response from a given repertoire is to be found to each 
of several stimuli under the condition of ‘‘right’? and ‘*wrong’’ prompt- 
ings by the experimenter. 

Expected learning curves are derived for the case of disjoint response 
repertoires associated with the stimuli under cyclic and random presen- 
tation of the stimuli and for the case of a single response repertoire (a 
one-to-one matching problem) under cyclic presentation. 


The perfect learner, like the Bourbons, never forgets anything. 
Unlike the Bourbons, he never fails to learn from his mistakes. 
Specifically, he never makes the same mistake twice. Further- 
more, when he makes a guess, he makes the shrewdest guess, 
maximizing the probability that it is correct. In other respects, he 
is quite pedestrian. His guesses are shrewd by virtue of being 
calculated, not intuited. He forms no concepts and gets no in- 
sights, and he has never experienced the ‘‘Aha!’’ phenomenon. He 
learns by an efficient but unimaginative trial and error routine. 
For this reason, he obeys a perfectly straightforward mathematical 
learning model without free parameters. 

What parameters there are in the model are entirely situational, 
determined by how much is to be learned, how much information is 
fed back to the learner, and the schedule of stimulus presenta- 
tions. With regard to the information fed back, the problem be- 
comes trivial if the learner is corrected by being always given the 
correct response after making a trial response to a given stimulus, 
because he will always thereafter respond correctly to that stimu- 
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lus. We shall assume the next simplest case, namely, where the 
learner is told only whether he is right or wrong. 

The ‘‘amount to be learned’? will depend on the number of dif- 
ferent stimuli to which correct responses must be learned and on 
the range of possible responses (the repertoire) from which the 
correct ones are to be selected. The schedule refers to the order 
of stimulus presentations. 

We shall consider two kinds of response repertoires and two 
kinds of schedules. Of the four associated models, we shall con- 
sider three, namely, 


Case 1. Disjoint repertoires; cyclic schedule. To each of M 
stimuli s; (i = 1, 2, ...M) there corresponds a distinct response of 
N possible responses, ri; (j = 1,2,...N). Upon being presented 
with s;, the learner knows that the corresponding response is 
among the N responses r;;.. The stimuli are presented cyclically: 
Sig S25 20eSMs S15 SQ oeee 

Case 2. Disjoint repertoires; random schedule. The learning 
task is the same as in Case 1, except that the stimulus presented 
each time is randomly selected (with replacement) from the set of 
stimuli. 

Case 38. Common repertoire; cyclic schedule. Here the same 
repertoire serves for all the stimuli. We shall consider the case 
where there is a one-to-one correspondence between the stimuli 
and the responses, hence M = N, so that every response is correct 
for exactly one stimulus. 

We will not consider the case of common repertoire and random 
schedule in this paper. 


Case 1. We seek P,(¢), the probability that a wrong response 
will be given on the ¢th presentation. Let ¢ = gM +i (q, ¢ integers; 
qg20;0<2¢SM). Since the probability of a wrong response to a 
given stimulus depends only on the number of times the stimulus 
had been presented and not on the response given to any other 
stimulus, we have 


Pi(qM +4+1)=Py(qM +2) fori<M. (1) 

Moreover, because of perfect memory and cyclic schedule, a 
wrong response can be given to a stimulus on the (gM + i)th pres- 
entation only if a wrong response had been given at the [(g ~ 1)M + 
é]th presentation (¢ > 0). Since the stimulus presented on the t;th 
presentation is the same as that on the ¢;th presentation for all 
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¢; =ti mod (M), the probability of a wrong response to a given 
stimulus on the (¢ + gM)th trial (q = 0, 1, 2, ...) becomes succes- 
sively (N-1)/N, (N -2)/(N —1), etc., as long as the correct 
response is not given. Hence the probability of wrong response is 
given by 


Po(qM + i) = Pol(g - tym +a Vo 9 * for0<qSN-1; 
Berd (2) 
=Qforg>N- 1. 


Combining (1) and (2) and resubstituting ¢ for gM + 7, we obtain 
N= 1 a jie 
baer aL “lah (3) 


where [¢/M] is the greatest integer less than ¢/M. 

Equation (3) represents a step-function of ¢. The expected 
cumulated-error curve will be a broken line with breaks in the 
slope occurring at ¢= gM (q¢<WN). The expected total number of 
errors will be M (N — 1)/2. 


Case 2. The probability that the stimulus chosen on the (¢ + 
1)th presentation had been chosen j times in the ¢ preceding 


presentations (0 Sj <¢) is given by a? (1 -—2)*"i, where z = 
M7}, 

Given that there have been j previous presentations, the proba- 
bility that the correct response had not been given is 


N-1 lr " N-j _N-j 


Ne ae et ANB et, NO 
Therefore, the probability that the correct response to a stimu- 
lus chosen j times had not been guessed and will not be guessed 
on the (¢ + 1)th presentation is 
: t\ ; _.N-j-1 
Py(t + 1,7) = (jer oe) Me 


=0 forj2N—1. 


forj7 <N-1; 
(4) 


The probability of wrong response on the (¢ + 1)th presentation 
is then 


Min(t,N-1 N-7-1 
aN & es ii 
Paced) os (jJera yi, (5) 


j= 
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or 
Min(t-1,N—71) ° 
ab eth pa irate Canes Mee » 
j=0 
If ¢ SN — 1, we can write (5) as 


N Soto /(t\ a eee =. /6\\ St = Nene 
H tf (ounnr'=B Se (e-or tM 
j=0 


j=0 


Equation (7) is evidently a ‘‘smoothed out’’ version of (3). 


t > N — 1, the more general formula (5) or (6) applies. 
The expected number of cumulated errors is given by 


w(¢) = e Pant bl 


In particular, the expected total number of errors will be 


co §6Min(t—-1,N—1) 2 
y (5) aa 


w (c) 


? 
tea | j=0 N 
Min(t,N-1) co ; . 
Pe: Seiya 
j Wiss 
J=0 t=0 


where y = 1 —z, and x = 1 by convention. 


To calculate w (co), we first write (9) as 
v(t taitey Se (fey Maat 
t=O j= t=N j=0 

The first double summation reduces to 


ye [Sete ene 
os N MN 2M ‘ 


t=0 


(6) 


(7) 


For 


(8) 


(10) 


(11) 


To calculate the second double summation of (10), we note that 


it can be written as 


N= 
y a) |e y! ] N-k-1 
La ay lil 1ny N 


(12) 
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and separated into two summations: 


a Ol yN | ke taky fat ys as 
£4 ay* ki t-y} £4 N ayFlkY 1-yf 


Let us call the first summation of (13) F (2, y). Then the sec- 


ond can be written asd [Er (x, y)|. Define 
r 
Pave es ot 1: 
=a i), 
ak) 
Q@, y= )°= 
(z, y) = La oP = 


Then, using Taylor’s Theorem for polynomials, F(z, y) can be 
written as 


(14) 


AONa 2) 


AO ed arrears 1D! ayNat 


P (y). (15) 


The last term of (15) reduces to aY~!, Since (18) can be written 
as 


F(@e,y)- = Ere, | (16) 


we can now write 
=-P )+Q(a eae —"P (ety) += Q(e ce 
=~-P(r+y 3Y Po ha N ’ N 


--P (e+y) + Q (yy) + 2871 + =P ery) (17) 
7) 2 
+= Pety)- FON) 5g COV) 


We now use z + y = 1 to calculate 


. eo _(N- 1) - 2), 
P(@+y)=N; QW = T=; ay 2 (29) (1 -y) ) 
PR ya ee 


2 


326 ANATOL RAPOPORT 


Substituting into (17), we obtain 


oo =N-1 : 
t ; ~;N—-1-7 N-1 (1 - y)(N - 1) 
os = ey ae tees eet Fe eee a spel 
(ety N N + 5 + i+ 9 


ae (18) 
N-1 (N-1)(N -2) 
N(l-y)  2(1-y)N ’ 


which reduces to 


(NAD) tN Aha 


19 
2(1-y) 2M Se? 
Combining (19) and (11), we finally obtain 
w (0) = ON, (20) 


This result can, of course, be obtained directly by noting that the 

expected number of errors to each stimulus is $(N — 1). The same 

result obtained by calculating Lim w(t) is a check on the correct- 
too 


ness of our formula for w(¢). 


Case 3. In this case, the learner must learn to match each of 
the N responses 7; with some stimulus s;. The completion of this 
task is equivalent to determining one of the N! permutations of the 
N responses fj", fj,5 igs «+ rin such that the response rj, is cor- 
rect for the stimulus s;. To make the best guess at each stimulus 
presentation, it is sufficient to adopt the following strategy. Ar- 
range the N responses in some arbitrary order rj, ro, ... ry; keep 
giving the response r; to stimuli s;, sg,...until it matches some 
sje Therefore, in the correct permutation of responses, r; will oc- 
cupy the position fiz. After r; has been so placed, continue with 
m. until it has been matched, etc., except, of course, that the 
proper responses will always be given to the stimuli already 
matched. 

With this procedure, eventually the correct response to every 
stimulus will be made, and this must occur not later than the Nth 
cycle. 

The calculation of the learning curve involves some results of 
combinatorial analysis, which we shall present in the course of 
our discussion. 
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Let us associate with each ordered partition of the integer N, 
namely w th II = (mi, na, ... nz), where nj > 0, Sn; = N, a set of 
permutat’ s {P} of the N responses in the following manner. We 
take the first n; responses of the correct permutation, P* = (ri; 
res «a1 riy)s and, without disturbing their relative order, place 
them into some nj; of N ordered slots. Then, we take the next no 


responses of the correct permutation and place them, again pre- 
serving their order among themselves, but not necessarily in rela- 


tion to the first n,, into some ne of the remaining N — 7, slots. 
We continue until the last block of n, responses are placed (with 
order preserved) into the remaining nz slots. This operation will 
associate O (II) distinct permutations with each partition II, where 


N! 
Ni!no!... nz! 


0 (Ml) = (21) 

In each of the permutations of this set, at least n, of the re- 
sponses r;", rj, ... will lie in the same order as in P*; of the per- 
mutations Sere Tn,+2, +++ at least no will lie in the same order, 
etc. We are, however, interested in the subset {P’} of this set of 
permutations, which preserve the order of exactly ni, no, ... nk of 
the responses in the correct permutation P*. Of these permuta- 
tions there will be o(II). We seek the relation between o (II) and 
O (Il). 

Consider the ordered partitions I1* of N with exactly & ele- 


: 2e N- 
ments, i.e., the partitions (1, m2, ... mk). There are |, _ 4 


such partitions. 

Definition 1. The pernicn I1*~* is an immediate contraction of 
IF = (mi, nay «++ me) if PW! = (mi, may vee Me + Meaty Megas e+e ME) 
for some t, 1<t¢<k. Evidently every Il* has & -1 immediate 
contractions. 

Definition 2. The partition II_*~* is a contraction of eet 
there exist partitions I]: Patel Bs . yell; feySuch that: Il. ko is 
an immediate Seton of I; i pl ae te I, Seca phende LL. : ior 
Heaetas i.e., there exists a ‘path” over Suimadsats donncuens 
from II * to II, k-~s_ Evidently the relation ‘‘contraction of*’ is 
transitive, and ie shall write I *-* cCI*. We have immediately: 
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Lemma 1. Let {P;} and {Pn} be sets of permutations associated 
respectively with partitions Ib and NES ee: in the manner described 


above. Then {P,,$ c {P;} if and only if yates CG Th 
Theorem 1. A partition Ni: has i a ‘ contractions among the 


partitions Il*-s, 

Proof. Consider the lattice EL of the partitions of N. All the 
partitions IT* are on the Ath level, and the bonds of the lattice 
connect each partition with its immediate contractions. An ex- 
ample is shown in Figure 1. 

Consider the sub-lattice L’ of L consisting of the partition and 
all of its contractions, such as is shown in Figure 2. Note that 
the total number of contractions of a partition depends only on the 
total number of elements in the partition and not on the magnitudes 
of the elements. Therefore L’ is isomorphic to the entire lattice 
of the partitions of the integer &. In that lattice, the (& — s)th 
level, which coincides with the (&-s)th level of L, has 
( ee : ) = ( : ' partitions. This proves the theorem. 

Now imagine that the partitions, which are the ‘‘nodes’’ of L, 
are ‘‘neurons.’’ Upon being stimulated, they transmit (instantane- 


1,1, 1,121,121 


FIGURE 1. The lattice Z of all the partitions of VN = 5. 
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FIGURE 2. (a) The sublattice L” of all the contractions of [J 4 = Oe 
iG 1; 1) of 5. (b) The latice of all the partitions of NW = 4, isomorphic 
tol. 


ously) impulses along the bonds to other ‘‘neurons,”’ i.e., to their 
contractions. Any number of impulses received by a neuron simul- 
taneously (through all the bonds converging on the neuron) are 
counted as a single impulse. Suppose now we stimulate each of 
the neurons on the kth level of the lattice once in succession. We 
wish to know how many impulses will be received by each neuron 
on the (& — s)th level. 

Theorem 2. /f each of the ‘‘neurons”’ on the kth level in the lat- 
tice L is stimulated once in succession, each of the ‘‘neurons’’ on 


the (k — s)th level will have received (" co i ) impulses. 

Proof. By Theorem 1, when each neuron on the fth level is 
stimulated, (‘ E ; neurons on the (& — s)th level receive an im- 
pulse. Being all received by distinct neurons, they are counted as 
distinct. Moreover, there are (; ¥ i) neurons on the sth level. 
Since they are stimulated in succession, the resulting volleys all 
contribute distinct impulses. Therefore a total of e i i: (f y ) 


distinct stimuli will have been received on the (k — s)th level. 
Since the structure of the lattice does not depend on the ni, each 
neuron of the (& — s)th level will have received the same number of 


impulses. There are es ia :) neurons on that level. Therefore 
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the number of distinct impulses received by each neuron on the 
(k — s)th level is 


(37) (ane a 
(hoa) ree 


We seek o[II (ni, no, ... nz)], i.e., the number of permutations 
P’ associated with II. By Lemma 1 and Theorem 1, we have 


ales) 
emit Ss 
O (UL *) = ie Sec Ie ears (23) 


s=0 j,=1 


where j. ranges over all the contractions of * at the (& ~ s)th 
level. We shall first prove: 
Theorem 3. 


(Eo a een (vrs Cay) 


a 0 (*) = = ae ye o(IF~*). (24) 


ix 1 sS=0 J=1 


Proof. The left side of (24) represents the result of counting 
the permutations associated with the partitions II*, counting the 
repetitions of each permutation as distinct permutations. This 
process is isomorphic to counting the impulses generated at all 
the levels of order k, k —-1,...N —k+ 1, when the ‘‘neurons’’ on 
the kth level have been stimulated in succession, since each set 
of permutations {P‘*)} includes all the sets {P‘*-5)}. Now the 
sets of permutations {P’} are all disjoint, since each permutation 


P’ uniquely determines the partition with which it is associated. 
Therefore to obtain O(II nO from a (LFS), we must weight the con- 


tribution from the (& — s)th level by the number of times the corre- 
sponding permutations have been repeated. By Theorem 2, the 
weighting factors are given by equation (22), which are equivalent 
to those of (24) by the symmetry of binomial coefficients. 

Theorem 4, 


yar OF) 


o@¥)= >) Dd. (1) Ody). (25) 


j=0 i=1 
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Proof. By Theorem 3, we can write the right side of (25) in the 
following form: 


o (11%) + (3) >a aL) 3)x atleast) a4 


i 


- (a) >, oa) - Alps oN a3)=.4 28 (38) 


ee Sees eS ee Ore SRS eS ie) eB 6. B 8 41m 6 Ge 0) 0) © a 0.0 © 0) 0) © 0 6 @ 0 46. 6 


Adding by columns, we observe that all the terms vanish except 
the first, which proves the theorem. 

We again invoke the invariance of the lattice structure to state 
the Corollary. 


: ad 
ca J 
BU  eG) eye. .0(11;.*7)). (27) 


Equation (27) enables us to determine the number of permuta- 
tions P’ associated with each partition II* of N, since O(Il) is 
given by (21). 

Let us return to our learner. He is using an arbitrary permuta- 
tion of N responses (ri, ro, ... fy) aS a ‘‘strategy permutation’’ 
according to the procedure described above. As eventually r, 
matches the correct stimulus, he will place it in the corresponding 
position, preserving the order of the remaining responses. This 
insures the most efficient ‘‘search and fix’’ procedure. 

The number of matchings the learner will have made in the first 
cycle will be equal to the exact number of responses in his Strat- 
egy permutation which are in the same order as the corresponding 
responses in the correct permutation P*. Let these responses be 
iy 72, +++ Tn, Thereafter these responses will always be given 
correctly. The additional responses, to be guessed correctly in 
the second cycle, will be exactly those no responses, fn,+1, «+ 
?n,+n,» Which are in the same order as the corresponding responses 
in P*, etc. 

Therefore the number of ‘‘strategy permutations’’ which will re- 
sult in guessing exactly n; responses in the first cycle, n2 in the 
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second, ... nm; in the kth is o[I](n1, na, ... nx)], and the proba- 
bility of this event is given by 


I (ni, may «ee me) =< OLI (1, nay «++ nay]. (28) 


Given the event corresponding to II (n1, no, ... 2%), the learner 
will have given n, correct responses on the first cycle, n1 + ng on 
the second, ... N on the &th, and N on every succeeding cycle, 
because once a correct response is given to a stimulus, it will 
always be given to the subsequent presentations of that stimulus. 

Therefore the expected number of correct responses given to the 
qth cycle will be 


E [e(g)] = 2M (m1, na, ... mk)e(q, ), (29) 


where 
q 
0(g,k) = )" mi, if g Sk, 
a 


aN ifeg. Sk, 


and the II (n1, no, ... ne) range over all the partitions of N. The 
average probability of wrong response to the stimulus presenta- 
tions in the gth cycle will be 


Pu(q) = 1-51 (80) 


As we have seen, the computation of this probability involves 
the following steps. 


1. The listing of the partitions of N. 

2. Computation of O(Il) = N!/(ni! ng! ... ng!), corresponding 
to each partition (n1, no, ... nx) of N. 

3. Computation of o (I) from (27) and (28). 

4. Computation of EF [c(q)] from (29). 

The computation of P,,(¢) for each ¢ is more difficult and has not 
been undertaken in this paper. 

Note that the method yields not only the expected number of 
errors which the perfect learner will make in each cycle but also 
the probability distribution for the possible number of errors. An 
example of such a distribution calculated from the II (nis Na, oe 
n,) is shown in Table 1 for N = 5. 
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TABLE 1 


Errors* 


Cycle 


*The entries are probabilities (x 120) of the total number of errors 
made by the perfect learner on successive cycles in a search and fix 
task with a single repertoire of stimuli cyclically presented. 


We see from the table that the expected total number of errors is 
5. This number can also be obtained directly as in the preceding 
case by noting that the respective total numbers of errors to the 
stimuli in the order in which the correct responses are guessed 
will be (N — 1)/2, (N — 2)/2,... 1/2. Therefore the expected num- 
ber of errors will be 


Wat is 


j=l 


The expected cumulated numbers of errors in each cycle are 
given in Table 2 compared with the expected cumulated numbers 
of errors of a learner who gains a constant amount of information 
per error (cf. Rapoport, 1960) but has the same total number of 
cumulated errors as the Perfect Learner. We note that the Perfect 
Learner makes more errors in the beginning of the process and 
later catches up with the Constant Gain (per error) Learner. 

William Livant (thesis unpublished at this writing) has compared 
some 300 empirical learning curves with curves determined by 
Rapoport’s equation (1960), which is closely analogous to H. D. 
Landahl’s equation (1941). In all cases without exception, the 
experimental points either fitted the curves almost exactly or lay 
above the theoretical curves. Since the parameter k was always 


TABLE 2 


4.70 


Constant Gain Learner 


Perfect Learner 
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fixed by the asymptotic value of w (cumulated number of errors), 
the discrepancy between data and theory (if it appeared) always 
appeared as a “bulge.’’? Characteristically this bulge in learning 
curves averaged for groups of individuals appeared more frequently 
in tasks with smaller total uncertainty. Typically these were also 
the more complex tasks: the smaller uncertainty was due not so 
much to smaller repertoires of stimuli or responses as to greater 
interdependence (redundancy) among the stimulus-response as- 
sociations. Accordingly the explanation for the ‘‘bulges’’ was 
usually given in terms of the discrepancy between the objective 
(a priori calculated) total uncertainty of the task and the uncer- 
tainty perceived by the subjects. The latter would naturally be 
expected to be greater, since the subject would be expected not 
to utilize the redundancies in the task completely. It was also 
shown mathematically that if H were taken as greater (and & cor- 
respondingly, since H/k =W, the fixed asymptotic value of w), the 
theoretical curve would be lifted throughout. Thus the ‘‘bulges”’ 
were eliminated by ‘‘tuning’’ H, i.e., treating it as a free parame- 
ter subject only to the restriction H => H*, the a priori calculated 
uncertainty. 

Our result on the Perfect Learner indicates that there may be 
another reason for the bulges, namely the partial presence of per- 
fect-learner characteristics in the subjects. In fact the two curves 
tending to the same asymptote, namely the Perfect Learner’s curve 
above and the Constant Gain Learner’s curve below, determine a 
band in which all the experimental points of rote learning curves 
are expected to lie. 

Finally we note that the Constant Gain Learner with the same 
asymptote as the Perfect Learner in a single-repertoire matching 
task with N = 5, has a rate of information gain constant of k = z 
(log e 5!) = 1.044. The highest observed & for human subjects was 
(Livant, unpublished) about 0.6. The lowest observed & was 0.002 
(in monkeys). In fairness to the human subjects, it should be 
pointed out that all tasks examined were considerably more com- 
plex than the matching task with N = 5. In such tasks an average 
human subject will probably behave as a Perfect Learner. We 
conjecture that the limit for perfect rote learning in tasks of this 
kind is about N = 7. In fairness to the monkeys, it should also be 
pointed out that in the experiment referred to (Harlow, 1959), the 
monkeys had to make a rather sophisticated abstraction (sophisti- 
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cated for monkeys, that is), so that the task should not perhaps be 
considered a purely rote matching task. It is noteworthy, however, 
that the monkeys’ learning curve fitted the constant-gain learning 
curve of Landahl and Rapoport almost perfectly. 

Evidence for the conjecture that N = 7 is about maximal for per- 
fect learning appears in the data on human subjects presented by 
H. D. Landahl (1941). The matching tasks involved disjoint 
repertoires with only ‘‘right-wrong’’ reinforcements and with M = 
N =4, 8, and 12. The Perfect Learner should make on the average 
6, 28, and 66 total cumulated errors respectively (cf. our Equation 
20). The asymptotes in Landahl’s plots are (estimated) 1.5, 4, 
and 7 cumulated errors per stimulus, i.e., 6, 32, and 84 errors in 
all on the respective tasks. We see, therefore, that Landahl’s 
subjects behave like Perfect Learners in the smallest task, nearly 
so in the medium task, and depart significantly (toward more er- 
rors) in the largest task. 

It may be worthwhile, therefore, to recast rote learning theory so 
as to make the Perfect Learner’s curve the base line from which 
real learning curves are obtained by the introduction of parameters 
of memory loss or departures from the perfect ‘‘search’’ strategy. 
It must be noted, however, that the present method is applicable 
only where stimulus-response matchings are a priori equiprobable. 
If the task contains redundancies, the best guessing strategy is 
not obvious and presents a separate problem. 
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Detailed equations are given for the computation of aortic distensi- 
bility in the intact living human patient from measurements of systolic 
and diastolic arterial pressures, heart rate and cardiac output. From 
these equations, the aortic characteristics of a large series of normal 
men of different ages were computed. Comparing these results with 
measurements on excised aortas, a more pronounced trend toward in- 
creasing aortic stiffness with increasing age is evident in living than in 
dead aortas. Nor-epinephrine and exercise apparently cause the living 
aortas to constrict but to become more distensible. The same change 
occurs after 30 minutes of high spinal anesthesia. The ganglionic block- 
ing agents hexamethonium, pentamethonium, and tetraethylammonium 
usually cause the living aorta to become stiffer, presumably due to dila- 
tation. The aortas of patients with pulmonary disease usually react in 
different fashion to exercise or tetraethylammonium. The increased aortic 
distensibility that occurs with the hypertension induced by nor-epineph- 
rine or exercise acts aS a compensatory mechanism by decreasing sys- 
tolic pressure. 


Part I. Theory. Arecent theoretical analysis of the human systemic 
arterial system (Cope, 1960) has provided a method for the determina- 
tion of aortic distensibility in the living human patient. Using this 
method, it is possible to compute the aortic distensibility constant of 
a patient from the data obtained in the usual hospital cardiorespira- 
tory laboratory. The data needed for this calculation include sys- 
tolic and diastolic arterial pressures as obtained by brachial ar- 
terial puncture, heart rate, and cardiac output as measured by the 


*Opinions and conclusions contained in this report are those of the 
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direct Fick or dye dilution methods. The calculation of a patient’s 
aortic distensibility from this data consumes from 15 to 20 minutes 
using paper, pencil and slide rule, but requires only a few seconds 
if the problem is programmed on a small digital computer. Insofar 
as it has been possible to check them, the equations on which the 
method is based gives computations that are usually accurate 
within 5% (Cope, 1960). 

Interest has developed in the application of this method by var- 
ious hospital cardiorespiratory laboratories in cooperation with 
local digital computer centers. Such collaborations will allow 
quantitative evaluation of the distensibility of the aortas of the 
individual patients to be obtained from the usual cardiovascular 
measurements with little additional trouble or expense. These 
quantitative measures of the aortic distensibility should allow more 
comprehensive and accurate assessment of a patient’s cardiovascu- 
lar system than is now possible. 

Part I of this report extends the previous mathematical work to 
include a derivation of the equations used to compute the aortic 
distensibility constant in the living human. Part II discusses some 
of the physiological and clinical applications of the method. 

Since there is a change in shape of the aortic pressure-volume 
curve at 170 mm. Hg, it was necessary to derive separate equa- 
tions for P, £170 mm, Hg, and for P, > 170 mm. Hg. Since it was 
found empirically that P, = 0.85 P, max., the dividing line between 
the two sets of equations occurs at P, max. = 200 mm. Hg (P, = 
mean systolic pressure, P, max. = maximum systolic pressure), 


Case I. (P, £170 mm. Hg or P, max. < 200 mm. Hg.) 


From the general theory of the human systemic arterial system, it 
has been shown (Cope, 1960, equations (10) and (14)) that 


2a(P, - P,) + 2.30 (2aP, +b) 1 Mai fel ada 

d ( ) logio E 5 P| R’ (1) 
Ae Adee A 
tee BE) + a(P,” — P?) + b(P, - Pa), (2) 


where: 


P, is mean systolic pressure (by brachial arterial puncture) in 
mm. Hg. 

Pq is minimum diastolic pressure (by brachial arterial puncture) 
in mm. Hg. 
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P, is venous pressure in mm. Hg. 
N is heart rate in heartbeats per minute. 
1, is duration of systole in minutes (Cope, 1960, Figure 2). 
Tz is duration of diastole in minutes (Cope, 1960, Figure 2). 
V, 1s cardiac output in cc. per minute. 
F is total peripheral resistance in mm. Hg x minutes x (cc.)7?}. 


a and 6 are defined by the parabolic approximation to the pres- 
sure-volume curve of the aorta as follows: 


V =aP?+bP ie 
where: 


V_ is the aortic volume in cc, 
P is the aortic pressure in mm. Hg. 


By plotting a scattergram of the values of a and 6b from parabolic 
curve fits of the pressure-volume curves of isolated human aortas 
(Figure 1), it was found that 


a = (—3.06 + 0.9) x 107 (3) 


In a similar manner, it was found that (Cope, 1960) 


P, = 0.85 P, max. (4) 


P, max. is maximum pressure during systole (by brachial arterial 
puncture) in mm. Hg. 

T, and Tz may be obtained from the heart rate N from Figure 2 
from Cope, 1960. 

We now substitute both equations (3) and (4) in equations (1) and 
(2) to eliminate the variables a and P,, thus obtaining two new 
equations that contain P, max., which is easier to measure than Pe 
We then solve these two new equations simultaneously, making 6 
and R the unknowns. Only the solution for 6 is given here since Fk 
can be more directly determined in other ways. 

By these means, we obtain an equation for the aortic distensi- 
bility constant 6 as a function of experimentally measurable vari- 
ables of the human systemic arterial system, as follows: 


ts -T, 
» - |£ 1.(0.85P, max - Py) (5) 


A aie 
D T,(0.85 P, max — P,) 
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O 
Oj 5-24 Shee eet 
x |lO3 a 


FIGURE 1. Scattergram of coefficients of parabolic approximations of 
P-V curves of isolated human aortas. 


where: 


A =-3.0 x 107° {32.0 login | eee wemaxin Pel 
P,-P, 


2(0.85 P.max - ra + 2.80 logio [Peehanss =P), 
df» 


CG = —0,9 x10" 389-84 logig | oon sme Pee 
Pi) Py 
2(0.85 Psmax — Pal 
D =-3.0 x 107° [0.72 (Ps max)? fs P?] ite P,): 


E = a ~ 0.9 x 107? [0.72 (P, max)? - P32], 
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FIGURE 2. Scattergram of distensibility constant (b) vs. patient age 
for data on isolated dead aortas. 


Case II. (P, > 170 mm. Hg or P,max. > 200 mm. Hg.) 


From theory, it has been shown that (Cope, 1960, equations (15) 
and (16)) 


170. as 
2a(170 — P;) + 2.30(2aP, +b) 1 7 6 
a( d) + (2aP, + 5) logio (oe! . (6) 
and 
Voor i ets 2 2 
eps Peal A) sb 71 0i— B;): (7) 


We now proceed as in Case I to substitute equations (3) and (4) 
in equations (6) and (7) to obtain two new equations which are 
then solved simultaneously for 6 and R. The resultant equation 
for 6 is as follows: 


Le ee 
5. |¥_T(0.85 Psmax ~ Py) (8) 


F -T, 
H T,(0.85 P,max — P,) 
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where 


FP a -3.0 1078 39.98 lop |e oe 
Pa Py 
170 — P 
2.30 1 fara y Ms 
ote | 


G = 0.9 x 107? 132.24 log, | =e) 0070 - PD}; 
Fa oa | 


H==3.0.%10- 170" = Pebie (iG ae), 


i ~ ~ 0.9 x 1078 [(170? - P?)). 


Hence, one may compute the aortic distensibility constant 6 
using equation (5) or (8) depending upon whether peak systolic 
pressure P,max. is below or above 200 mm. Hg. The computations 
are tedious when done by hand but are completed in seconds when 
programmed on even the smallest digital computer. The computer 
program must of course include the instruction to make the choice 
between equations (5) and (8) depending on the value of P,max. 


Part II. Physiological and Clinical Applications. This section 
will review briefly the characteristics of the isolated human aorta 
and its relation to the development of the theory, and then will 
compare computed characteristics of the intact living aorta with 
the experimentally-determined characteristics of isolated dead 
aortas, Finally, various physiologically and pharmacologically in- 
duced variations in the aortic elastic characteristics in the living 
human will be shown and their effects on blood pressure will be 
demonstrated. 

The development of the theory mentioned above required the as- 
sumption that the entire aorta expanded and contracted simultane- 
ously in all its parts. For this type of analysis it was necessary 
to know the elastic characteristics of the aorta in the form of an 
equation giving the volume of the entire aorta as a function of 
aortic pressure. There are available in the literature two reason- 
ably comprehensive studies of the pressure-volume P-V curves of 
the isolated human aorta (Hallock and Benson, 1937; Remington 
et al., 1948), both of which show the age of the patient to have a 
significant effect on his aortic elastic characteristics. Hallock 
and Benson (1937) inflated segments of descending thoracic aorta 
with isotonic NaCl solution, and found a progressive increase in 
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both baseline volume and in stiffness with increasing patient age. 
Remington e¢ al. (1948) measured force-length curves on a series 
of rings cut from different sections of the aorta, and then com- 
puted a P-V curve for the entire aorta on the assumption that there 
was no longitudinal stretching. They found a progressive increase 
of baseline volume as patient age increased, but no significant 
change in the shape of the P-V curve as a function of age. To the 
present author, the reasons for the discrepancies between these 
two studies were not at first apparent. 

Later, as a side issue in connection with some other experi- 
ments, the present author had occasion to inflate stepwise a series 
of ten isolated segments of human descending thoracic aorta and 
noted incidentally that inflation from 0 to 200 mm. Hg pressure 
caused a lengthening of 4 to 19%. Even if the aortic ends were 
held a constant distance apart, as was apparently done by Hallock 
and Benson (1937), a lengthwise stretch still occurred because in 
every case the aortic segment under study bowed outward to one 
side or to the other. Hence, in effect, the measurements of Hallock 
and Benson (1937) allowed lengthwise stretch but those of Reming- 
ton et al. (1948) did not. Since the shape of the longitudinal aortic 
force-length curve varies markedly with age (Krafka, 1940), this 
explains the discrepancy. 

The human aorta in situ is apparently held under considerable 
longitudinal tension, since when it is cut loose it shortens by 2 to 
15% (Remington e¢ al. 1948). In the living dog, length changes 
were much smaller than, and 180 degrees out of phase with, the 
diameter changes (Lawton and Greene, 1956). Hence, it seemed 
likely that aortic length changes in situ were small, so that the 
data of Remington e¢ al. (1948) were taken to be the best available 
approximation to the aortic P-V curve in the living human. 

In order to use the data of Remington et al. (1948), it was neces- 
sary to convert them from pressure vs. volume/meter” of body sur- 
face to the form of pressure vs. volume. Then, all the volumes at 
each pressure were averaged to give age group P-V curves. The 
P-V curves were roughly parabolic in shape, so the equation V = 
ap? + bp +c (a, b, c are constants) was fitted to the P-V data for 
each individual and each age group. This was accomplished by 
substituting the pressures and volumes at 80, 140, and 170 mm. Hg 
in the above equations and solving the resultant three equations 
simultaneously for a, 6 and c. The actual computation was done on 
a Bendix digital computer. Table 1 gives the values of a, 6 and c 
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TABLE 1 


COEFFICIENTS OF PARABOLIC APPROXIMATION OF PRESSURE- 
VOLUME CURVES OF ISOLATED HUMAN AORTAS* 


Age Group Means 
(years) 


18-33 to* 

34-46 —5.3847 x 1078 

49-58 —4.981 x 1078 

63-89 —3.317 x 1078 

hypertensive —5.533 x 1078 

Patient 

2 18 (F) —2.944 x 1078 +1.352 +38.75 
3 22 (F) —3.864 x 1073 +1.807 —52.38 
4 22 (M) —4.608 x 1078 +1.836 —37.01 
5 24 (M) —4,244 x 1078 +1.521 +0.82 
6 26 (M) —7.568 x 1078 +2.654 —52.77 
7 27 (M) —4,230 x 1078 +2.022 -16.54 
8 29 (F) '—1.467 x 1078 +0.860 +3.13 
9 30 (F) ~7.382 x 1078 +2.630 -79.88 
10 33 (F) —7.752 x 1078 +2.745 —83.72 
11 34 (M) —2.764 x 1078 +1.128 +67.51 
2 38 (M) —11.074 x 1078 +3.912 —131.06 
13 40 (F) — 0.920 x 1078 +0.420 +97.48 
14 41 (M) —1.448 x 1078 +1.528 +39.96 
1S 42 (M) —12.924 x 1078 +4.419 — 157.85 
16 44 (M) —7.012 x 1073 +2.515 +20.41 
uy 45 (F) —0.920 x 1073 +0.421 +86.54 
18 45 (M) —5.353 x 1078 +1.865 — 14.69 
19 46 (M) —6.079 x 1078 +2.562 —48.01 
20 46 (M) —5.896 x 1078 +2.438 +61.19 
21 49 (F) —4,.982 x 1078 +1.817 +17.20 
22 50 (M) —2.766 x 1078 +1.061 +76.86 
23 50 (M) —6.270 x 1078 +1,242 +23.33 
24 50 (M) — 4.058 x 1078 +1.497 +93.03 
25 52 (F) —3.690 x 1078 +1.348 +2.12 
26 53 (M) —2.208 x 1078 +1.023 +94.27 
27 55 (F) —5.347 x 1078 +2.065 —4.93 
28 55 (M) —6.646 x 1078 +2.300 +11.42 
29 58 (M) —10.525 x 1073 +3.539 —20.20 
30 58 (F) —5.536 x 1078 +1.989 +45.14 
31 63 (M) —1.473 x 1078 +0.659 + 123.63 
32 65 (M) —2.208 x 1073 +1.023 + 72.39 
33 68 (M) -5.897 x 1078 +2.408 —34.55 
34 68 (M) —2.024 x 1078 +0.932 +113.29 
35 71 (M) —8.494 x 1073 +2.874 +59.80 
36 76 (M) —2.210 x 1078 +0.956 +186.14 
37 81 (M) —2.947 x 1078 +1.252 +192.97 


38 89 (F) —4.430 x 1078 +1.545 +66.71 
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TABLE 1 (CONT.) 
Patient 
(age & sex) 


*Calculated from the data of Remington, et al. (1948). 


for all of the individual cases and age groups of Remington et al. 
(1948). It is evident that c, which is an index of baseline volume, 
increases considerably with age. A scattergram plot of a vs. 6 
(Figure 1) revealed that the points were very close to a straight 
line so that one could compute a =(-3.08 + 0.9) x 107° with only 
slight error. In this paper, only values of 6 are given, since @ can 
always be determined from 6 by the above equation. The quantity 
6b serves as an index of aortic distensibility. A slight trend to- 
ward decreasing 6 (decreasing distensibility) with increasing age 
can be noted in the age group means in Table 1. 

Assuming that the aortic P-V curves were parabolic, and that a 
direct proportionality existed between pressure and flow through 
the peripheral vascular system, it was possible to develop equa- 
tions relating the physical variables of the systemic arterial sys- 
tem. For details, see Cope (1960). The final equations were used 
to compute values of 5 in the living human for a series of 68 normal 
males of a variety of ages from data on systolic and diastolic ar- 
terial pressures, pulse rates and cardiac outputs. These data are 
taken from Brandfonbrener, Landowne and Shock (1955), and Lan- 
downe, Brandfonbrener and Shock (1955), and the complete raw 
data was kindly supplied by Dr. Shock. Venous pressure is as- 
sumed to be 7 mm. Hg for these and all other calculations in this 
paper. Calculation of } from systolic and diastolic blood pres- 
sures, heart rate and cardiac output was performed by solving 
equations (10) and (14) in Cope (1960) simultaneously for 5 and 
peripheral resistance. Then, 257 sets of values of the various 
variables were substituted in the resulting equation and values of 
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6 were calculated for these 257 different situations. The computa- 
tions were straightforward and could easily have been done by 
hand, but a Bendix digital computer was utilized to save time. 

Table 2 shows values of 6 in 68 normal living patients, and age 
group means computed from the data of Brandfonbrener e¢ a/. (1955) 
and Landowne et al. (1955). The age group means for the living 
patients showed a much more pronounced trend toward smaller 6 
(decreased distensibility) with increasing age than is evident for 
the isolated aortas. The same difference can be seen by com- 
paring the scattergrams of 6 vs. age for isolated dead and intact 
living aortas (Figures 2, 3). A fairly clear negative correlation 
between 0} and age is evident for the living aortas but not for the 
dead aortas. It is the author’s speculation that increasing blood 
pressure in the aging patient causes active relaxation of aortic 
smooth muscle via the carotid and aortic reflexes, thus making the 
aorta larger but stiffer. 

The development of a method for the calculation of aortic elastic 
characteristics in the living human paved the way for a study of 
possible physiological and pharmacologically induced changes in 
this variable. Such studies, to the author’s knowledge, have sel- 
dom been done in animals and have never been done in humans. 
Wiggers and Wegria (1938) found that when hypertension is induced 
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FIGURE 3. Scattergram of distensibility constant (d) vs. patient age 
as computed for intact living aortas. 
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TABLE 2 


COMPUTED VALUES OF b FOR INTACT LIVING AORTAS 


Age Group Means Patient Patient 
ae b 
(years) Initials Age 


20-30 C,E 2.26 
30-40 Gal 1.50 
40-50 C,A 1.10 
50-60 D,W Levb 
60-70 26: 1.82 
70-80 P.R 0.77 
80-90 F,C 3.92 
H,E 1.72 
N,J 2.17 
Patient Patient H,C 0.89 
J,C 19 wae ee 
W,T 19 eee : 
R,E 21 A,J 0.86 
N,C 22 D,R 1.48 
K,R 23 J,c 2.04 
P,C 25 J,M 0.83 
B,G 27 W,J 1.38 
H,F 27 G,M 1.72 
B,J 29 C,G Dee 
S,W 30 ais eee 
J,0 31 be 
S'A 4 B,C 1.62 
M,R 33 DyJ oy 
P,O 33 a Ls 
J,A 35 M,H We 
J,R 35 O,T ie 
P,T 35 T,A pips 
W,A 39 Dee 1.48 
W,E 39 B,L 1.59 
S.C 0.66 
M,M 40 W.C 1.58 
DJ 41 : 
K.E 41 E,Ss 1.74 
Pil i MJ 2.37 
TC i OG 0.68 
Ba ay RD 1.82 
oe rc TH 0.65 
pie Zs LW 0.82 
a 7 BT 1.10 


in a dog by stimulation of the central end of the vagus, or by giving 
epinephrine after vagotomy, that the aorta contracts and decreases 
in radius, but becomes more distensible. Alexander (1954) found 
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that epinephrine caused this same effect in the isolated aorta of 
the living dog. Apparently, in the contracted aorta, the muscle has 
shortened sufficiently to remove tension from some of the elastic 
tissue, so that the aortic elastic characteristics are determined 
more by the distensible muscle and less by the stiffer elastic and 
fibrous tissue. This phenomenon is discussed in terms of vessel 
wall structure by Burton (1954). 

Using the published data of Goldenberg e¢ a/. (1948), it was pos- 
sible-to compute the aortic distensibility constant 6 in a series of 
patients before and after the administration of nor-epinephrine 
and/or epinephrine. As shown in Table 3, in 7 of 9 patients, nor- 
epinephrine caused increases in 6 (greater distensibility). How- 
ever, epinephrine produced no consistent effects. The author in- 
terprets this as meaning that nor-epinephrine causes a constriction 
of the aorta with increased distensibility, as was found by Wiggers 
and Wegria (1938) in the dog given epinephrine. 

From the data of Riley e¢ al. (1948), values of 6 were computed 
for 3 normal subjects and 2 patients with pulmonary pathology, 
while at rest and for mild and severe exercise, as shown in Table 
4. Oxygen consumption is shown as an indication of the severity 
of the exercise. It is evident that exercise caused large increases 
in 6 (greater distensibility) in all three of the normal subjects but 


TABLE 3 


VALUES OF b BEFORE AND AFTER NOR-EPINEPHRINE* 


Patient Before nor- 
(Age) epinephrine 


Patient 
(Initials) 


After nor- 
epinephrine 


1.53 1.77 
C.G. 3.20 4.29 
B.H. 3.62, 2.49 5.33, 5.66 
S.T. 2.20, 1.89 2.57, 3.38 
F.P. 2.52, 2.60 3.11, 3.40 
E.B. 3.60, 4.42 2.94, 5.34 
Ta) 3.49 4.03 
TR: 1.95, 2.02 3.52 

5.99 5.46 


*All patients were normal males except first one who was female and 
hypertensive. 
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TABLE 4 


VALUES OF 6 BEFORE AND AFTER EXERCISE 


Oxygen Consumption 
(ml/min/m?b,s.) 


Mild Sever 
Rest : : a 
exercise | exercise 


1 


Value of b 


Patient 


No. Condition 


normal 


378 normal 
379 normal 
366 pneumonec- 1.86 
tomy for 
tbe 
372 | pulmonary 
fibrosis, 
emphy- 


sema 


no consistent change in those with pulmonary pathology. The ef- 
fect of exercise on the aorta is apparently the same as that of nor- 
epinephrine in normal subjects. The absence of a consistent effect 
in the patients with pulmonary disease might be due to the fact 
that their exercise was relatively mild, as judged from oxygen 
consumptions. 

The effects of the ganglionic blocking agents hexamethonium 
and pentamethonium on aortic elastic characteristics was studied 
by computing 6 in hypertensive patients before and after receiving 
these drugs from the data of Grob e¢ a/. (1953). In 3 of the 4 cases 
studied, the ganglionic blockers caused moderate decreases in 6 
(decreased distensibility), as shown in Table 5. The author inter- 
prets this as probably being the opposite of the nor-epinephrine 
effect, i.e., the aorta becomes dilated and less distensible, due to 
blocking of the sympathetic ganglia. This would result in decreased 
nor-epinephrine production, thus allowing relaxation of the aortic 
smooth muscle. 

Another ganglionic blocking agent, namely, tetraethylammonium, 
was found to have a similar effect on the aorta. Computations of 6 
from the data of Fowler e¢ al. (1950) show that in 5 of 7 normal or 
hypertensive subjects, administration of tetraethylammonium was 
followed by a decrease in 6 (decreased aortic distensibility), as 
shown in Table 6. The cause of this effect would seem likely to 


350 FREEMAN W. COPE 
TABLE 5 


EFFECTS OF HEXAMETHONIUM AND PENTAMETHONIUM ON b* 


Value of 6 


Patient 
Before Drug 15 min. After Drug 


*All patients are lying down. 


be the same as in the case of hexamethonium and pentamethonium. 
However, in 3 of 5 patients with pulmonary disease, tetraethylam- 
monium had the reverse effect, namely, an increase in 6. The au- 
thor is at a loss to explain this difference, but it is interesting to 
note that the aortas of patients with pulmonary disease also reacted 
abnormally to exercise. 


TABLE 6 


EFFECTS OF TETRAETHYLAMMONIUM ON 6 


After Drug 
cor pulmonale 
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The effects of spinal anesthesia on the aorta were studied by 
calculation of 6 from the data of Sancetta e¢ al. (1952). As can be 
seen from Table 7, there is a significant rise in } in 4 of 4 cases 
after 30 minutes of high spinal anesthesia. No consistent pattern 
of change of 6 can be seen after 60 minutes of high spinal anes- 
thesia or with low spinal anesthesia. 

A long series of calculations of b were performed for patients 
subjected to hypoxia or heat from the data of Doyle e¢ al. (1952) 
and Sancetta et al. (1958), but no consistent pattern of change of 
6 due to either of these stresses could be detected. 


TABLE 7 


EFFECTS OF HIGH SPINAL ANESTHESIA ON }b 


Value of 6 


Patient Before After 30 min. After 60 min. 
Anesthesia of Anesthesia of Anesthesia 


When a patient is subjected to nor-epinephrine or exercise, 
either of which raises arterial pressures, the change in 6 is in the 
direction of increased distensibility. If other things remain con- 
stant, this change lowers the systolic pressure, although diastolic 
pressure is raised. Hence, changed aortic distensibility apparently 
can serve aS a compensatory mechanism in situations involving 
high systolic pressures. To determine the magnitude of this com- 
pensation, the systolic and diastolic pressures which would have 
occurred if the compensatory change in 6 had not occurred were 
computed from the general equations of the systemic arterial sys- 
tem. This technique was applied to the cases of nor-epinephrine 
administration of Goldenberg et al. (1948) and those of exercise of 
Riley et al. (1948), These values for the hypothetical pressures 
before compensation are shown in Table 8. It is evident that in 
every case, the increase in 0 lowered systolic pressure and raised 
diastolic pressure. 
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TABLE 8 


ELEVATION OF b AS COMPENSATION FOR HIGH SYSTOLIC PRESSURE* 


Under Treatment 


b 
Patient (aortic distensi- Maximum systolic and 
initi bility coefficient i i ssures 
ances Weentnient 1lity co ) diastolic blood pressu 
nee As computed, if 
in years) As actually 


6 had remained 
at its rest value 


Bevoe  ucr-epiuepne 160/84 182/65 
rine 


At rest measured 


B.V., 25 Leo 5.66 | 156/85 178/64 
S.T., 48 pe 168/92 173/86 


*Data for computations on patients subjected to exercise was taken 
from Riley et al., 1948. Data for nor-epinephrine came from Goldenberg 
et al., 1948. 
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Dimensional analysis is discussed from the viewpoint of its basic 
group properties and shown to be an algebraic Abelian group that is use- 
ful for analysis of physical measurements. The application of the method 
to various types of equations and the formulation of previously unclassi- 
fied dimensions are discussed. Functional dimensional analysis is ap- 
plied to the problems of cell size and biomass proliferation; future ap- 
plications are also noted. A number of dimensionless terms have been 
formulated for cellular physiochemical phenomena. They apparently 
represent the first systematic study of biological dimensionless numbers 
recorded in the literature. A dimensionless proliferation law is sug- 
gested, A brief analysis of the physical dimensionality associated with 
information measures is carried out. Entropy and ‘‘information’’ are 
shown to be completely different in their dimensional meaning; other in- 
formational measures of possible interest in biology are proposed. The 
dimensional coding and computor analysis of biomathematical equations 
is suggested. 


1. Introduction. Most persons familiar with the term dimensional 
analysis consider it to be little more than a systematic method for 
checking the consistency of equations and converting systems of 
units. However, careful examination of the technique and its his- 
tory reveals that in actuality dimensional analysis is a complex 
and interesting branch of algebraic theory, with a very broad po- 
tential range of application. 

Newton (1686) referred to the method as the ‘‘Great Principle of 
Similitude.’? Fourier (1822) made important contributions to di- 
mensional analysis during his study of heat flow and conclusively 
established the requirement that physical equations be dimension- 
ally homogeneous. Lord Rayleigh (1915) was a strong protagonist 
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of the technique and applied it to a variety of problems, including 
light scattering in small water droplets, the optical phenomenon 
which is responsible for the blue color of the sky. Dimensional 
analysis advanced strongly with the growth of modern hydrody- 
namics, through the work of Reynolds and many others. It was 
known to J. Clerk Maxwell and probably played a substantial role 
in the formulation of Maxwell’s equations. P. W. Bridgman (1922) 
published one of the more thoughtful early analyses of dimensional 
analysis and demonstrated that it could be applied with profit to 
problems of basic physics such as gravitation. 

More recently important progress was made by G. D. Birkhoff 
(1960) who explored the group properties of dimensional analysis 
in connection with differential equations. He later subsumed this 
technique under an even more powerful general approach, which is 
designated ‘‘inspectional analysis’’ and is based on the search for 
general symmetries and elementary group properties in physical 
situations, and particularly in differential equations. 

Academician L. I. Sedov (1959), widely held to be responsible 
for important progress in the Soviet rocket and astrophysics pro- 
gram, has published a definitive volume on similarity and dimen- 
sional methods in mechanics. Persons who suppose dimensional 
analysis to be essentially a trivially simple matter need only glance 
at a few pages in the works of Birkhoff or Sedov to learn other- 
wise. The Russian literature also contains volumes by D’yakonov 
(1956), Nogid (1959), Kirpichev (1953), Venikov (1956) and Reznya- 
kov (1959) on applications of dimensional analysis to problems 
ranging from electrical analog models to foaming of liquids, physio- 
chemistry and thermodynamics. The French scientist Esnault- 
Pelterie (1950) successfully applied dimensional methods to mete- 
orology. From the practical engineering standpoint dimensional 
analysis has innumerable uses in problems of heat and fluid flow, 
aerodynamics (von Karman, 1954), design of electrical machinery, 
etc. Langhaar (1951), Murphy (1950) and other authors offer a good 
introduction to such applications. Dimensional analysis has also 
been applied to problems in magnetohydrodynamics (Cowling, 1957) 
and rheology (Eirich, 1956) and is sometimes the only usable 
analytic method in these fields. 

It is thus rather surprising that dimensional methods are so little 
understood in the biological sciences. One may well agree with 
Lord Rayleigh in his statement: ‘‘I have often been impressed by 
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the scanty attention paid even by original workers in the field of 
physics to the great principle of similitude. It happens not infre- 
quently that results in the form of ‘laws’ are put forward as nov- 
elties on the basis of elaborate experiments, when they might have 
been predicted a priori after a few minutes’ consideration.’ It will 
be shown below that this conclusion also applies to problems of 
biomathematics. 

The writer believes the present article to be the first pub- 
lished systematic application of dimensional analysis in biology, 
in which use is made of dimensionless numbers and functional 
dimensional equations. D’Arcy Thompson (1959), author of On 
Growth and Form, applied dimensional methods in the study of 
gross phenomena such as flying, swimming and walking. He 
deduced certain scaling laws, such as those relating mass and 
running speed or wing area and flying speed, but worked be- 
fore the advent of modern biophysics. In his encyclopedic recent 
work on mathematical biology, Rashevsky (1960) applies dimen- 
sional analysis for checking certain differential equations, in the 
formulation of constants and also in derivation of some relation- 
ships, such as those governing bending of tree branches, but he 
does not provide any systematic discussion of biological dimen- 
sions or dimensionless numbers. Dimensional considerations are 
also used by Giinther & Guerra (1955, 1956) for analysis of heart 
and respiratory rate as a function of animal weight. They have, of 
course, entered indirectly into a great many other biomathematical 
works. 


2. Methods of Applying Dimensional Analysis. In order to dem- 
onstrate how dimensional analysis may be applied in biomathe- 
matics it is best to provide a brief series of examples from the 
physical sciences. Assume that a formula is to be derived for the 
velocity of sound in a gas, knowing from Sepouuente) work that 
the only significant variables are the velocity v = 2 L/T, the pres- 
sure P2M/LT®? and the density p £M/L*. The usual MLT basis 
of dimensions is used and Table 1 provides a list of variables 
found in the text. Part II contains a more extended tabulation of 
variables with possible biomathematical applications. The symbol 
‘<4? stands for ‘‘equals dimensionally.”’ Simple inspection of the 
variables reveals that they can be combined in the relationship: 


= P/p. (1) 
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PHYSIOCHEMICAL VARIABLES 


Variable Symbol (s) Dimensionality 


Mass M 
Time If , 
Frequency thes 
Length (cell radius, etc.) L 
Velocity lids 
Acceleration (gravitational) L/T? 
Area L? 
Volume L3 
Density, mass concentration M/V 
Concentration N/V 
Rate of density change M/VT 
Synthesis rate of a substance N/VT 
Mass flow rate M/AT 
Flux (molecules, photons, etc.) N/AT 
Diffusivity coefficient L2/T 
Permeability coefficient for concentration L/T 
gradient 
Surface energy a M/T? 
Viscosity, coef. rigidity | M/LT 
Kinematic viscosity v L4/T 
Stress tensor, pressure (F'/A) fens e x M/LT? 
Osmotic pressure 7 M/LT? 
Force F ML /T? 
Energy E ML2/T2 
Temperature é 6 
Reaction constants K T-1, N/T, N/T? 
Molal gas constant R E/0 


Ion motility 


o 

= 
= 
<a 
4 


The dimensional identity symbol* I 
A number or numerical function n, f(n) I 
Molecules, moles N 


*Can be used for any dimensionless number, dimensional identity or 
real number. See text. 


Except for a numerical constant, which varies with the particular 
system of units used, this is the correct demonstrated relationship 
between the variables in question. It must be stressed that such 
an analysis is possible only when one knows or has strong reasons 
for hypothesizing which variables are pertinent for a given problem. 

Poiseuille’s law for fluid flow in pipes may be derived similarly 
and is a good deal more complex. The variables of interest are: 
volume flow rate Jy £1,3/T, pressure gradient p £M/L?T?, radius 


DIMENSIONAL ANALYSIS 359 


r2L, and viscosity 7 £M/LT. These variables may be combined 
in 


J =pr*/n. (2) 
Poiseuille’s actual ‘‘law’’ (Bull, 1951) is: 
J = (n/8) (AP/L) */n, (3) 


where a geometric factor is added and pressure gradient is given 
explicitly. 

A surprising number of basic equations of physics and mechanics 
may be derived in this manner, as shown for example by Huntley 
(1953). This somewhat unexpected result is based on the fact that 
dimensional ‘“‘laws’’ express relationships which are independent 
of scale of measurement. Newton’s laws must certainly hold 
whether metric or English units are used. Stated more formally, 
this requirement is equivalent to requiring invariance for an equa- 
tion under the scale change transformation 2’ = kz, in which z is a 
general variable and & an arbitrary scaling factor. This substitu- 
tion is part of the general affine space motion group, as was pointed 
out many years ago by Felix Klein (1939). Dimensional equations 
and dimensionless numbers may be obtained from differential equa- 
tions using this technique, as is demonstrated below for the funda- 
mental Navier-Stokes equation of hydrodynamics (Langhaar, 1951): 


pg cos & dV — dp/dx dV +7 V* dV — p(udu/da + vdu/dy + 
wou/dz)dV =0. (4) 


New variables include &, the angle between flow lines (dimension- 
less); z,y,Z, the lengths; u,v,w, the velocities. This equation rep- 
resents, in order, gravitational pressure, viscous flow, and inertial 
resistance to flow forces. By making a dummy substitution for new 
units of measurements, in the form 2’ = La, p’ = Pp, etc., one ob- 
tains dimensional equivalents for each term and the equation is 
also found to be dimensionally homogeneous. Dimensionless num- 
bers are obtained by making ratios of these terms. The well-known 
Reynold’s number is the ratio of inertial to viscous forces and is 
given by 

Ip =vL p/n. (5) 


It has been established that for fluid in a pipe if the numerical 
value of the Reynold’s number exceeds about 2000, turbulence is 
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likely to occur. From the above, dimensional analysis might be 
said to be an ‘‘algebra of scaling factors.”’ 

The ability to get dimensional and dimensionless relationships 
from differential equations is of great importance when the latter 
are too complex for an analytic solution, as is often the case in 
biomathematics. The next example illustrates this point. An ana- 
lytic relationship is desired for the factors which influence thrust 
developed by a ship propeller in deep water. Experience indicates 
that the pertinent variables are: thrust 7 4ML/T*, diameter of 
propellor D, revolutions per minute n 2T~', velocity of advance 
v, gravity g, density p, and kinematic viscosity v (y/p). Combining 
these variables by the systematic methods of dimensional analysis 
(Buckingham Pz theorem) one obtains the following four dimension- 
less terms: 


1, = T/pD*v?; 1, =Dn/v; I, = Dv/v; 1q = Dg/v?. (6) 


The propeller problem is then formulated in the following functional 
_ equation 


F as Fe Up Iq) =0 


in which each /; term stands for a real number and the relationship 
is to be found empirically, deduced in part from boundary condi- 
tions, etc. (Keller and Doherty, 1961). This problem could have 
been approached solely by the method of differential equations, but 
it has not been established that a satisfactory set of equations can 
be written and the solution of such a system would be very difficult 
and perhaps impossible analytically. Considerable progress can 
be made by the dimensional methods, as shown. The original 
seven variables are reduced in number to four combined variables, 
which express certain particular aspects of the physical situation. 
For instance, /, is readily understood as the ratio of thrust to lat- 
eral inertial forces resisting rotation of the propeller. 

It is sometimes said that the dimensional method is completely 
formal and lacks any internal theory. The writer would not agree, 
because derivation of suitable dimensionless terms yields major 
insight into the physical nature of any particular problem. Also 
it allows a systematic study of how the most important variables 
influence each other. One may note that, after all, dx/dt =v, and 
a rate or acceleration may be handled as an algebraic entity in 
many cases. The above is certainly not meant to detract from the 
value of calculus, but merely to stress that dimensional analysis 
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has a contribution to make to almost any problem and especially 
those where analytic methods fail. 

As a final illustration, and one which is very interesting from 
the biological viewpoint, there is the derivation for pulsation fre- 
quency of a fluid droplet offered by Lord Rayleigh (1915). The 
droplet is suspended from the end of a tube immersed in a medium 
of rather similar density, and subjected to a weak mechanical dis- 
turbance. Variables include only f,o,p and radius r. These com- 
bine to give 


f? =0/r*p. GC) 


The relationship was confirmed experimentally by Lord Rayleigh. 
Pulsation frequencies have been a matter of interest in biomathe- 
matics for many years. 


3. Biological Dimensions. Study of the available lists of physi- 
cal dimensions, such as those given by Langhaar (1951) and Thun 
(1960), reveals them to be somewhat arbitrary and not complete, in 
the sense of not using every possible combination of integral ex- 
ponents (up to values of + 4 perhaps). This raises the question as 
to why all combinations are not in use and what logical criteria 
must be applied in deriving new dimensional terms. It is shown 
below that dimensional analysis is an independent algebraic struc- 
ture, obeying the properties of an infinite, denumerable Abelian 
group. Dimensionless ‘‘numbers’’ are then found to be cyclic 
transformation subgroups. 

Let A,B,C,...N be the basic elements of the dimensional set. 
The general compound dimensional variable is given by D = A"B™C? 
...N” in which the exponents take on positive or negative integral 
values. DD~' =I, where / is the identity element. The associa- 
tive and commutative laws hold: AB =BA and A(BC) = AB(C). 
Since there is no addition nor subtraction there are, of course, no 
distributive laws. Examination of the dimensional set reveals that 
the elements do not stand for real numbers, because they can only 
be combined in a more limited way, but they may be replaced by 
real numbers when the latter are related solely by product or quo- 
tient operations. Klein (1939) points out that the Greeks distin- 
guished between numbers used only in ratios (Ao’yoc) and the nat- 
ural numbers (ap:On0¢). It is of some interest that the usual sym- 
bols of calculus, dz/dt, AY, etc., behave more like dimensional 


elements than real numbers. 
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If P = AB/C and R =DE, then PR = ABDE/C, etc., because of 
associativity. From this point of view the question of ‘‘basic”’ 
dimensions becomes somewhat arbitrary. It is easily shown that 
if MLT is the accepted basis in Newtonian mechanics, a completely 
equivalent system may be built up from m =ML/T, v =L/T and T, 
or many other such independent triplets of terms. Dimensional 
algebra also contains equivalent classes, congruences on simple 
moduli and isomorphisms. These formalisms will be explored else- 
where in connection with physical and particularly analog modeling 
methods. 

From the viewpoint of the above theory, dimensional analysis is 
clearly a group; this same conclusion was reached by Birkhoff 
(1960) on the basis of invariance for the scaling substitution, but 
was developed in a different manner. Therefore any arbitrary com- 
bination of dimensional elements, such as D = M/L°T, is a “‘di- 
mension”’ and may turn out to be useful at some future time. It may be 
stressed that in an algebraic group elements represent an operation 
and in this instance the operation is multiplication or division of ini- 
tial numerical values to give additional derived numerical values. 
Viscosity could not be used until a means was found for obtaining 
the value of viscosity from elementary measurements. The Reynolds 
factor represents a complex dimensionless factor that characterized 
a complex property of a physical situation, i.e., its susceptibility 
to turbulence, 

The parameters by which a given property such as viscosity are 
measured may change, but the final result will be the same under 
given conditions. Therefore a dimensional quality represents a 
numerical invariant physical property of some system, just as V = 
L,L»4,L,. represents an invariant property. Measurement involves 
finding a point on a certain measurement axis. Each dimension 
may thus be said to represent a dimensional axis; the geometric 
interpretation of this statement will be pursued elsewhere. 

The Appendix to Part II contains a preliminary listing of pro- 
posed dimensions which may be of use in biomathematics; many 
additions will no doubt be made. When combining dimensional 
terms into an equality, which the writer uniformly designates by 
the identity symbol /, the question arises as to whether a dimen- 
sional identity is also a numerical identity. Since dimensional 
elements do not represent ordinary real numbers, a dimensional 
identity does not automatically define a numerical ‘‘law.’? The 
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range of conditions under which a dimensional identity is a nu- 
merical identity must be determined experimentally or from other 
theory, but in any case dimensional identities are highly sugges- 
tive. It is very significant that many basic laws of physics, such 
as F =MA, are also elementary dimensional statements. To illus- 
trate the provocative results of dimensional analysis, the writer 
suggests that adenosine triphosphate (ATP) may represent action 
as well as energy dimensionally. The matter remains to be settled 
on the basis of actual measurements of force, momentum, energy 
and time constants during contraction of muscle, but the dimension 
of action can at least be considered in muscular movement. It has, 
of course, played a most important role in physics and in charac- 
terizing atomic and molecular quantized states. 

It should now be evident that dimensional analysis is not nearly 
so simple as appears on first examination. The rather detailed dis- 
cussion provided above was necessary to demonstrate how dimen- 
sional analysis has been used in the past, but can only serve to 
introduce the subject. The interested reader is referred to one of 
the several excellent treatises on the subject, such as those of 
Langhaar (1951), Bridgman (1922), Birkhoff (1960), Sedov (1959) 
and others. 

4, Examples of Dimensional Analysis in Biology. This section 
will draw principally on examples from only two areas of biomathe- 
matics: cellular physiochemical problems and proliferation of bio- 
mass. Other topics, such as dimensionless numbers in general 
physiology, form similitude, muscular action, population growth 
and ecology, nervous system functions, etc., will be pursued in 
future continuations of this study. As is well known to students 
of mathematical biology, Rashevsky’s earlier writings, as con- 
tained in chapters 1-27 of his recent two-volume edition and also 
reviewed elsewhere by Rashevsky (1958) and Reiner (1960), deal 
with elementary cell properties as a function of concentration of 
various substances, diffusion and permeability factors, coupled 
chemical reactions, synthesis rates, etc. In general, the results 
obtained are given as solutions to differential equations, in the 
form of equations utilizing many constants which must be found 
experimentally. 

Table 2 presents a series of relationships containing the vari- 
ables listed in Table 1. Each /; represents a dimensional identity 
and potentially useful dimensionless number. This method of 
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identification is preferable to designation of each identity by some 
letter, as FR or ‘‘Re’’ (for the Reynold’s number), because of the 
large number of identities. Additionally, the /; terms may be com- 
bined with each other to yield still other derived relationships, as 
needed, Some of the combinations may appear trivial, but this 
simplicity is more apparent than real, as shown by equations (6). 
Even the combination / = V/L® is of use in studies of pipe rough- 
ness and is called the ‘‘fineness coefficient;’? J = L/vT turns out 
to be of interest for problems of resonance and is called the ‘‘Strou- 
hal’’ or ‘‘homochronism’’ number. Similar lists of dimensionless 
terms relating to fluid dynamics and chemical engineering are 
available (Boucher and Alves, 1959; Klinkenberg and Mooy, 1948). 

To the relationships given in Table 2 should be added a number 
of so-called ‘‘characteristic’’ ratios, such as C/C*, T/T*, etc., 
in which the starred value is some convenient reference point in 
the system. This is the same as expressing the parameter of in- 
terest as a multiple or percentage. As examples of characteristic 
values in cellular biology one may note cellular volume and radius, 
time for diffusion from the center to periphery of a cell or mitotic 
time, etc. The calculus formulas given in Table 2 were obtained 
directly from dimensional considerations, noting that dD/dL has 
the dimensions D/L and the grad and div have the dimensionality 
L~'. Study of each particular identity reveals its meaning. Any 
dimensional identity is, of course, convertible into a dimensionless 
form by making a quotient of left and right sides. It is suggested 
that dimensional coding methods proposed by Thun (1960) at the 
IBM Laboratories may be used for classifying biomathematical 
identities and equations. A systematic taxonomy of relationships 
would then be possible. 

To illustrate the use of identities in Table 2 attention is turned 
to the problem of cell radius. It is certainly apparent that some 
‘*similitude’’ of cell size is present among the cells of one or- 
ganism or even in the entire biological animal kingdom, throughout 
which cells generally have a radius in the small size range of 
about 10 to 40 microns. The existing analyses of cell radius offer 
a number of alternative equations for cell size, some of which im- 
plicitly contain dimensional relationships given in Table 1, but 
there appears to be no final conclusion as to which equation is 
really the most pertinent. Because of the complexity of the dif- 
ferential equations involved and the large number of constants 
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contained in them, it has not been possible to put the matter to 
any definitive experimental test, insofar as the writer is aware. 

It is now generally accepted that cells do not maintain any sub- 
stantial hydrostatic pressure, do not have tensed outer walls, ex- 
cept possibly during mitosis, and that their internal viscosity is 
very high (Brachet and Mirsky, 1960). Cells are known to reach an 
equilibrium size shortly after undergoing mitosis and are able to 
perform all of their normal synthetic activities, etc., regardless of 
small variations in shape and size (as shown by mutated giant 
cells); it has also been established that cell volume and nuclear- 
cytoplasmic volume relationships do not determine when a cell 
will undergo mitosis (Mazia, 1960). 

Under these conditions the most likely simple relationship be- 
tween the terms suggested is probably: 


AL/D = 6(C1/C3). (8) 


The first term of this equation may be interpreted as the ratio be- 
tween time for diffusion through the cytoplasm and passage through 
the membrane. The presence of a cytoplasmic volume and discrete 
membrane is certainly an indispensible necessity for a ‘‘cell.’’ 
Presumably the transit times through cytoplasm and membranes 
should not be too disparate. The second term may refer to any two 
substances, probably sodium and potassium ions, but possibly pro- 
teins or nucleic acid components which indirectly determine ionic 
concentrations. The symbol ¢ indicates a functional relationship 
between these two dimensionless entities. The latter is uncertain 
and will have to be studied experimentally. 

It is, of course, conceivable that factors other than the five con- 
tained in equation (8) will be found important for cell similitude. 
For instance, an average synthetic rate for some key substance, 
always produced by the cell, might enter. In this case term /2, 
which relates synthesis rate to concentration, could be included. 
Viscosity and surface tension may perhaps also play a role and are 
included through /g,. The final relationship among the 8 variables 
is then readily expressed by the functional equation: 


ly = p* ssl sa, (C;/C2)). (9) 


Such a step would be completely analogous with what is done in 
hydro and aero-dynamics research routinely and has yielded very 
useful results. In this case the original relationship between 8 
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variables is compressed into a functional relationship between 4 
variables, 

The fact that the variables listed in Table 1 enter into the prob- 
lem of basic cell parameters such as size is accepted as common 
knowledge, since diffusion, permeability, synthesis, etc., are 
physical parameters which are discussed in every text on physical 
chemistry or biochemistry (Bull, 1951). The dimensional equa- 
tions (in the form of dimensionless terms) given above are certainly 
homogeneous and correct by definition (it is to precisely this point 
which Lord Rayleigh refers in the above quotation). The problem 
remains of finding the final form of the equations. Instead of start- 
ing with a set of complex differential equations containing a large 
number of undetermined coefficients, dimensional analysis suggests 
finding empirical and theoretical relationships between the more 
compact dimensionless terms. Some of the relationships present 
in Table 1 are implied in Rashevsky’s work, but were obtained in 
a different manner. 

Dimensional analysis (and inspectional analysis) serves to limit 
and condense variables by restricting their modes of combination. 
In order to validate theories of biomathematics they must be put to 
experimental test. The study of empirical relationships between 
pertinent dimensionless terms should make this task more practical. 


5. Protoplasmic Forces. Before turning to a different topic, some 
mention may be made of the problem of ‘‘diffusion forces.’’ Rashev- 
sky discusses such forces in his analysis of cell size, cell divi- 
sion and cell movement; the existence of these forces is justified 
by equation (25) in Volume II, page 115. 


rad C, (10) 


2 M,, 


where F is the force on a fluid volume (or particle) with the volume 
Vin» M, is the molecular weight of a diffusing substance, and grad 
C is the mass concentration gradient of the same substance in a 
given direction. This gives ‘“‘grad C’’ the dimensions M/L*. Not- 
ing that (R @) is energy per mole at a given temperature and that M, 
is mass per mole, the above equation is found to be dimensionally 
homogeneous. It thus suggests that a diffusing substance generates 
a force acting on a fluid volume or a particle suspended in solution. 
However, some difficulties arise in the experimental demonstration 
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and further theoretical analysis of diffusion forces. One may note 
that if a force is present in a fluid and it acts on a particle of finite 
size, a pressure must be generated from the relationship F/A=P. 
There would then be a pocket of pressure on the high-concentration 
side of the particle; yet this pressure would be propagated at the 
velocity of sound in the medium, which far exceeds the diffusion 
velocity, and thus promptly be dissipated. The situation is worse, 
not better, in a gelatinous medium, where hydrostatic pressure 
would still be transmitted quickly while diffusion was very much 
slowed down. 

This problem is an especially confusing one and an outstanding 
example of a situation in which dimensional analysis may be of 
help. The concentration term in equation (3) is assumed to have 
the dimensions of M/V, rather than Cy, moles per volume ¢ Vie 
which formally makes grad C identical with grad p. A force would 
indeed be generated by the movement of a region of altered density 
through a fluid. However, in the case of diffusing molecules typi- 
cally encountered in cytoplasm, the density change is very small; 
in addition, many substances are being formed and diffuse simulta- 
neously. The limiting case of the latter phenomenon is the diffusion 
of white balls into a field of black balls, of identical mass, in 
which case, clearly, there is no force in the direction of the black 
balls. 

It thus appears that ‘‘diffusion forces’’ play a role only under 
special circumstances and that some other explanation is needed 
for cellular phenomena such as protoplasmic streaming. The neces- 
sity for explanations other than diffusion forces has also been 
noted by Rashevsky in Chapter 8 of the first volume of his two- 
volume work (1960). 

It is readily shown experimentally with any type of inert particles 
suspended in a drop of water that introduction of a crystal or mix- 
ing with an adjacent droplet of high-concentration solution produces 
vigorous fluid flow, which can carry along particles. It is well 
known that water will flow into a region of elevated molecular con- 
centration, thus diluting it; this is apparently what takes place in 
the experimental phenomenon noted. The mathematical formulation 
of such streaming, in the presence of variable cytoplasmic vis- 
cosities, osmotic pressure gradients generated by particles syn- 
thesizing substances at variable rates, with the participation of a 
large number of materials with various diffusivities, etc., is an 
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exceedingly difficult matter. Some of the dimensionless terms 
given in Table 2 are directly applicable to this problem and may 
be expected to be of considerable practical value in the further 
analysis of protoplasmic streamings. 

As seen in this example, dimensional considerations may be of 
the greatest importance in derivation of equations for mathematical 
biology. The use of concentrations carefully identified as to sub- 
stance and molar or molal representation can prevent many serious 
conceptual difficulties. 


6. Biomass Proliferation. Another example of a general biologi- 
cal problem to which dimensional analysis may be applied profitably 
is that of basic growth or biomass proliferation. Among the many 
works on this matter some of the most definitive ones are by Weiss 
and Kavanau (1957) and Landahl (1959). The general case of bio- 
mass proliferation can be analyzed in terms of the dimensional vari- 
ables given in Table 3. Several things in this table deserve com- 
ment. The subscripts zt, j,...n refer to different substances; a 
separate dimensional measurement axis is legitimate whenever the 
substance can be uniquely separated from the total biomass present 


TABLE 3 


VARIABLES RELATING TO BIOMASS 


Variable Dimensionality Interpretation 


1.4; Quantity of biomass or species; 
change in biomass AM;,j. 

2. dM ;/M; Increase of M; measured relative 
to itself. 

3. M;/M; Ratios of biomass; also 6M;,/6M;. 

4, dM;/dT Proliferation rate of M;; also loss 
of M; with time. 

Me Self-interaction of mass M, as in 

te production of waste in limited 

volume. 

6. M;M; Heterogeneous interactions; as with 


symbiosis, predation, etc. 


7, (dM;/dT)T 


Time specific self-increase rate of 


M; M;. 
L 
8. (M;/T) Relates rate and acceleration of 
(M ,/T?)(T) growth to ‘‘age.”’ 
J 


*Can be expressed as M;/M; if M;, M; are clearly distinguishable. 
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and/or other substances. Otherwise the generic variable M must 
be used. The variables given above can be combined further, e.g., 
ratios of terms such as No. 4, No. 7, etc. The time in term No. 7 
is what might be called a ‘‘system time.’’ The variable ‘‘T’’ thus 
has a specific numerical value which increases monotonically with 
passage of time and enters explicitly into the dimensionless vari- 
able No. 7. It is of interest that time enters in a similar way in 
systems reaching a heat equilibrium. In other cases time may en- 
ter as a time constant with dimensions 7 '. 

The items listed in Table 3 might be considered as directly 
derivable from dimensional analysis of the simple variables M and 
T. Are its prediction as to what terms will turn out to be useful 
borne out by actual experience? To evaluate this point one may 
cite some of the more important equations and terms which are to 
be found in existing treatments of the subject. In a major work 
Weiss and Kavanau (1957) give the following set of differential 
equations as a final statement of their theory of feedback con- 
trolled growth of a growing biomass Mg and a differentiated biomass 
Mp, the quantity of growth inhibitor substance Q, the dilution 
volume of the latter V, and a number of time constants &; 2 T7'. 
Starred terms are ‘‘final’’ values at equilibrium. Symbols have 
been adapted for purposes of consistency with nomenclature in 
this article. 


dM ¢/dT =Mcko EE “ seo] eee 


(Vis) 
oh fT) 


aMp/dT =koMc¢ +kyMc¢ [1 - Cen |- ea. (12) 


dQ/dT =ks Q* [[f(Mc/Mé); (Mp/MB)] - Q/Q*]. (13) 


In the text, the [(Q/Q*)/(V/V*)] term is replaced in part by (C/C*), 
where C is the inhibitor concentration. Some use is also made of 
terms with the form dM¢/M¢ and dMp/M@. The time constants k; 
may be removed to the left-hand side of the equation, to give terms 
which are identical with No. 7. The (log 2) term was introduced as 
a Special term to provide for mass doubling. It is clear that the 
presented equations contain dimensional relationships in a very 
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basic form. Their appearance in this recent work supports the con- 
cept that an a priori search for dimensional relationships will be 
helpful. 

Another interesting example is found in a discussion by H. D. 
Landahl (1959), An analysis of regeneration of skin or organs fol- 
lowing partial removal of the structure is shown to lead to an equa- 
tion whose dimensional equivalent is: 

St tal 
T (dM /dT) 


In the first approximation the unknown function & may be replaced 
with a simple constant; this gives a reasonably good fit of avail- 
able data. Equation (14) states that the time-specific relative 
growth rate is simply proportional to how much of the structure is 
missing, a very interesting and surprisingly simple conclusion. The 
mechanism of this dependence is presently largely unknown, but if 
the form of the dependence is corroborated in other examples equa- 
tion (14) could be classed as a possible ‘‘law.”’ 

Other authors have suggested still other types of equations for 
mass proliferation. In separate works Schmalhausen (1927), Levy 
(1952) and Osgood (1952), all propose that d(log M)/d(log T) be 
constant during growth. This corresponds with use of No. 7 in 
Table 3; it is not clear that the ‘‘system’’ nature of the time was 
recognized in all cases, however. It may also be pointed out that 
No. 7 or the presented log equation are equivalent to: 


(dM/M) = b(dT/T), (15) 


which is the most general statement of relationships of ‘‘self- 
measured’’ changes to each other, i.e., relationships arising during 
growth of several structures. Osgood (1960) has recently demon- 
strated that this simple equation may be used to predict the growth 
of humans from infancy to old age with remarkable accuracy. The 
same form is applicable to dL;/L;, dV/V, etc. It has also been 
demonstrated that different parameters of the bony skull grow ac- 
cording to a law which is linear on log-log plots (Ford, 1959). 
Work by Gompertz (cited by Weiss and Kavanau, 1957) led to the 
equation d(log M)/dT = log (M*/M)*, in which a time constant was 
presumably implied. Von Bertalanffy (1941) offers the arrangement 
d(log M)/dT = (a/M)'~* — b, where a and 3 are presumed to be time 
constants. Later work by this author makes use of a growth equa- 
tion containing an anabolism term proportional to area dd and a 


= (1 -M/M*). (14) 
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catabolism term proportional to volume; good experimental fits were 
obtained (von Bertalanffy, 1957). Robertson (1923) and Shock and 
Morales (1942) use the equation d(log M)/dT = a(M* -M), which 
is very clearcut dimensionally but uses an arithmetic instead of 
geometric feedback factor. 

One may thus conclude that dimensional analysis of growth pa- 
rameters has a unifying and clarifying effect on the existing treat- 
ments of the subject. By formulating terms such as No. 7, No. 3, 
etc.,of Table 3, the plotting of growth data is much simplified. 
After derivation of equations (11), (12) and (13), Weiss and Kavanau 
(1957) tested their results by introduction of the equations and var- 
ious constants into an analog computer, with good final results. 
It is suggested that analog plotting of dimensionless terms can be 
used for study of other problems, such as analysis of compartment 
models and a series of linked chemical reactions. 


7. Dimensions of Information. Asa final illustration of techniques 
by which dimensional analysis can be applied in biology, brief 
consideration will be given to the dimensional nature of ‘‘informa- 
tion.”’ The basic mathematical form of the information function 
H (#), which was originally suggested by Boltzmann and has been 
thoroughly studied in the recent decade (Brillouin, 1956; Yockey, 
1958; Luce, 1960; Cherry, 1956), is 


H(«) = )° 9; log p; (16) 


in which @ is the random variable for which the average probability 
of discrete values in an extended series of trials is given by p,. 
The simpler function log p specifies the number of ‘‘bits’’ (binary 
choices among equally probable alternatives), which are needed to 
select one particular entity out of an ensemble of entities. The 
theoretical basis of information theory will not be discussed fur- 
ther in this article. It is easy to see that equation (16) is dimen- 
sionless, since it contains only probabilities, which are numerical 
ratios. In order to apply the information function H (x) to any spe- 
cific physical situation it is necessary to introduce a dimensional 
constant. Table 4 includes two of the dimensional constants that 
are used in thermodynamics and then offers some additional ones 
that may be useful in biological problems. 
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The situations covered in Table 4 are best understood by noting 
that when the value of the function H(z) is large there is a large 
‘‘multiplicity’? (log2 7) of alternative states and conversely any 
single state is less likely to be encountered by chance. In the 
case of entropy, it so happens that presence of each state is as- 
sociated with a certain amount of energy, so that many states bind 
a larger amount of energy at a given temperature. In the choice of, 
for example, letters for a word, no such relationship to energy is 
present. Dimensional analysis is thus very helpful in clarifying 
the relationship of entropy to ‘‘information,’’ a matter that has gen- 
erated considerable confusion. What is more, it points the way 
to formulation of many new and interesting functions, such as one 
which mathematically characterizes natural selection efficiency, 
given in item No. 5, and a function which indicates cumulative 
‘*time storage’’ in a given enzyme, which synthesizes biochemi- 
cals previously produced at random, as suggested in item No. 4. 
A future work will be devoted to the further extension of this con- 
cept, and also to the applications of dimensional constants to 
functions which increase with ‘‘certainty’’ rather than ‘‘uncertainty.”’ 


The suggestions of Prof. N. Rashevsky on certain matters re- 
quiring clarification in the text are gratefully acknowledged. 
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A MINIMAL PRINCIPLE IN BIOMECHANICS 
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Expenditure of energy under several simultaneous forms (mechanical, 
chemical, etc.) is associated with all muscular activity. The energy is 
directly related to what is commonly called exertion or effort. This paper 
defines ‘‘muscular effort’’ quantitatively in terms of some of the elements 
of the dynamics of the human (and animal) body. It postulates that in all 
likelihood the individual will, consciously or otherwise, determine his mo- 
tion (or his posture, if at rest) in such a manner as to reduce his total muscu- 
lar effort to a minimum consistent with imposed conditions, or ‘‘constraints.’’ 

The principle, formulated inmathematical terms, is sufficient to ascribe 
to the moments at all body joints—a matter generally of free choice on 
the part of the individual—their most likely magnitudes. It therefore 
renders the equations of human (and animal) motion determinate within 
this context. The paper also describes briefly an iteration method for the 
solution of these equations, once they have been made determinate. A 
simple illustrative application of the principle is included. 


I, Introduction. The moving form has been a subject of abiding 
interest through the ages. Succeeding generations, from cave 
dwellers to civilized society, have expressed movement with pains- 
taking care and variety in painting, sculpture and dance, whatever 
may have been the mystic significance of these forms. During the 
Renaissance the interest became more sophisticated with the be- 
ginning of systematic attempts to study human motion and to describe 
its physical characteristics (Leonardo da Vinci, 1510; Borelli, 1679). 
In modern times the quantitative study of motion has been actively 
pursued on many fronts, among them time-motion problems (Marey, 
1894; Braune and Fischer, 1895-1900; Fick, 1904-1911; Gilbreth, 
1917; Strasser, 1917; Bernstein, 1926; Elftman, 1951), Recently 
the subject has attracted a sudden increase of interest for some 
important reasons. Among these may be mentioned the great popu- 
larity of modern athletics, the urgency of restoring function to am- 
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putees and victims of neuro-muscular diseases, and the new prob- 
lem of human travel in outer space. 

Practically all quantitative investigations of human motion so 
far performed, some of them quite extensive (as, for instance, the 
work of Braune and Fischer), are characterized by the physical re- 
cording of some prescribed motion, followed by a discussion of 
stresses at certain regions in the body (joints, points of support) 
on the basis of the recorded data. There is agreement as to the 
need for an analysis of human motion* in as great a generality as 
possible, based on principles of theoretical mechanics and so con- 
ducted as to provide fundamental understanding and to predict pat- 
terns and significant characteristics. No such analysis exists. 

This lack may be due not only to the complexity of the problem 
but also to the fact that the equations of theoretical mechanics are 
by themselves incapable of determining completely the unknowns 
in human motion, a matter largely of free choice on the part of the 
individual. This study concerns an effort to seek supplemental in- 
formation in certain speculative areas not considered part of theo- 
retical mechanics. 


Il. General. For an analysis of general human motion based on 
theoretical mechanics, the classic equations of dynamics (Newton’s 
laws) are applied to the several segments of the body, which are 
connected to each other at the joints. Written in their simplest 
form, this system of equations contains the following primary time- 
variable unknowns, in terms of which all other unknowns can be 
expressed: (a) the coordinates of some definite point in the body, 
relative to some fixed axes, to which all other points may be re- 
ferred; (b) the orientations of all body segments, relative to the 
fixed axes; (c) the moments at the ends of the segments at the 
joints; these are defined as the products of muscle tension by its 
lever arm at the joint, and will be referred to as joint moments; if 
several muscles act together upon the same end of the segment, 
their resultant moment will be taken as the joint moment at that 
end;t (d) reactions at all support points, but one. The following 


*It is understood in this paper that statements about ‘‘human motion”? 
may also refer to ‘‘animal motion.’’ In addition, the term ‘*‘motion”’ is 
understood to comprise ‘‘equilibrium’’ as a particular case (vanishing ac- 
celeration terms), 

tWhere muscles span over a body segment (two-joint muscles), their 


effect can be represented by appropriate moments located at the ends of 
the segment. 
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quantities are assumed given: (e) applied forces (generally weights); 
(f) physical characteristics of the segments (length, mass, center 
of gravity, moments of inertia); (g) initial values of unknowns 
listed above under (a) and (b) and of their first derivatives. (The 
fact that during motion the body segments do not remain absolutely 
constant in shape, nor the joints invariable in their geometry, may 
be disregarded in a first analysis; these effects may be taken into 
account at the cost of increased computations if refinements are 
necessary.) 

Unlike mechanical systems (for instance, an articulated system 
made up of rigid bars, links, and springs, which offers some struc- 
tural similarity to the body of vertebrates), the motion of the human 
body in response to a given set of influences (applied forces, ini- 
tial positions, and velocities) may be highly varied. This is a 
simple result of the fact that the unknown moments at the joints, 
capable of being generated at will by the normal individual through 
control of his muscles, are not determinable by the dynamics equa- 
tions containing them. From the viewpoint of mathematics, the 
equations of motion are indeterminate, that is, their number is ex- 
ceeded by the number of primary unknowns. The excess is pre- 
cisely the number of the joint moments. 

These moments, although voluntarily induced, are not abitrary. 
They must obey, generally in combination with some of the other 
unknowns, certain over-all conditions imposed upon the motion as 
a whole. Simple examples of such conditions are the maintenance 
of some prescribed velocity (constant or time-variable), of some 
given step length, etc. (Fixed patterns of movement, derived from 
culture, habit, etc., are considered to be other conditions of this 
nature.) These constraints, to use the term customary in dynamics, 
reduce the number of excess unknowns by their own number. Calling 
m the number of the joint moments and n that of the given constraints, 
the degree of freedom of the body is m — n; that is to say, it is pos- 
sible to ascribe values at will to m —n of the primary unknowns, 
and still satisfy the laws of mechanics and the constraints. 


Il. A Minimal Principle. At this point no further relations ap- 
pear available for the removal of the (m — n)-fold indetermination in 
the mathematics of human motion. Yet, little trace of uncertainty 
is apparent in the movements of the individual. Rather, his pat- 
tern of motion is generally precise at each instant, as though it 
were in obedience to some strong inner rule of conduct. This rule 
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seems concerned with the reduction of exertion to a minimum at all 
times, consistent with the task assigned. When the individual 
takes the shortest path between two points, or when he adjusts 
his stance according to some particular rhythmic pattern as he swings 
a hammer, he is instinctively obeying some such rule. This need to 
economize physical activity has already been recognized, but the 
assumption has been that it concerned mechanical energy and mus- 
cular work (Lehman, 1912; Lysinski, 1925; Contini, 1954; Cotes and 
Meade, 1960). Such an interpretation on the basis of concepts almost 
always connected with physical displacement seems unmindful of 
the fact that this concern for economy operates during rest also. 
An example of this can be seen as the individual quickly adopts 
one definite posture, out of many which are all equilibrium posi- 
tions, when asked to stand still with, for instance, one leg stretched 
out a given distance. That which the individual strives to minimize 
must therefore be an effect present during rest as well as during 
motion. 

Such an effect is muscular effort. We submit that this is the sub- 
ject of the individual’s concern, in view of its connotation of ex- 
ertion, discomfort, fatigue, pain, etc., and of its close association 
with mechanical as well as chemical energy and metabolism. The 
exact nature of the association is unimportant for the purpose of 
this study. What is important is to give the term muscular effort a 
concrete definition which is reasonable, appropriate for quantita- 
tive discussions, and likely to lead to useful results. 

We propose to define muscular effort in terms of the tension in a 
muscle and of its duration, or, reasonably enough, in terms of their 
product, as is commonly done in similar cases. Since a moment at 
a joint is proportional to muscle tension (tension x lever arm), this 
would define muscular effort as a function of the product of a joint 
moment with its duration. The simplest expression is c-M-At, in 
which c represents a numerical factor, M the moment at the joint, 
and A¢ the duration (time interval), An objection to this expression 
is that it leads to positive and negative muscular efforts, corre- 
sponding to the signs of joint moments; muscular efforts of opposite 
signs are not only meaningless, but would also cancel mutually in 
a summation of them, These difficulties are avoided by adopting 
the next simplest form, namely, c-M?-Az, as a measure of muscular 
effort at a joint. This expression has other advantages from the 
viewpoint of mathematical operations and will be used here, sub- 
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ject to possible revision on the basis of test indications, (A dis- 
cussion of factor c and time interval A¢ is given later.) 

On the basis of the preceding considerations, a minimal principle 
may be postulated as follows: A mentally normal individual will, in 
all likelihood, move (or adjust his posture) in such a way as to re- 
duce his total muscular effort to a minimum, consistent with the 
constraints. 

This principle is presented as a working hypothesis, however 
plausible it may appear to be. Like any other hypothesis, it can 
only be confirmed by agreement between experimental results and 
deductions based upon it. A simple test is suggested at the end 
of the paper as a preliminary step. One may also note that inde- 
termination of the kind discussed here is frequently met in the 
theory of structures under the heading of statically indeterminate 
systems, an area remote from biomechanics. There the problem is 
made determinate by the statement that the structure adjusts itself 
in such a manner as to reduce its internal work of deformation to a 
minimum.§ This is the basis of the method of least work in en- 
gineering. Minimal principles are also set up in other branches of 
science, and give rise to the methods of least action, least con- 
straint, least squares, etc. In all these fields, the principle has 
proved itself powerful and flexible. These examples, although un- 
related to the situation faced here, nevertheless allow some op- 
timism about the possibilities of the mode of approach suggested. 

For the purpose of a mathematical formulation of the preceding 
statement, represent by the symbol E the sum of the muscular ef- 
forts (c-M?-At) at all the body joints, plus some initial constant 
Ao; thus, 

E =(ciM? + coMd +...4+0,M,7)At + Ao, (1) 


in which the common time interval A¢ has been factored out; sub- 
scripts 1 to m denote the several joint moments, and c’s are nu- 
merical factors. The expression E will be referred to as effort 
function. 

Now assume that the constraints, n in number, are represented in 
the form of as many equations. For simplicity, these equations are 
written here in terms only of the joint moments and the variable 
time. (The case in which they also contain the orientations of 


§It may be noted, incidentally, that this work is a summation involving 
the squares of redundant moments or forces in the structure. 
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body segments is referred to subsequently.) The constraints are 
thus assumed to be represented by relations of the following gen- 
eral type: 

ia (M1, Mo, eeey M m,t) = 0, 

fo (M1, Ma, eeey M nt) = 0, 


ooo eee eee eevee eee eevee 


fn (M1, Ma, <0«5 M,,,t) = 9. 


(2) 


The minimal principle postulated above can now be restated in 
mathematical terms: The most likely motion of the individual is one 
in which the joint moments M1, Mo,..., M, minimize the effort 
function E (or its time integral, depending on the problem) and at 
the same time satisfy equations (2). 

A few words about the term Ao, factors c, and time interval Az: 

The constant term Ao drops out of the operations of minimization and 
need not be specified. So does time interval Az if function EF itself 
is to be minimized; in case minimization concerns the time integral 
of E between two given time limits, A¢ acts as the integration varia- 
ble. Factors c may be thought of as weights reflecting the relative 
‘sensitivity’? of the joints. They will operate in the direction of 
‘‘unloading’’ weak, painful, disabled joints in the same sense as 
the individual ‘‘favors’’ them. For normal individuals, operating 
under normal conditions, they may be taken as equal and will drop 
out. For others, c values should be assigned to the joints by trial, 
in relation to their relative sensitivity. 

Formulation of the minimal principle in terms of only the ‘‘vol- 
untary’’ part of the joint moments has been considered in this study. 
This is defined as that part of the moment which would remain after 
removal of what may be a component produced by the passive ex- 
tension of antagonistic muscles during rotation of the joint. Under 
such a formulation, this component is excluded from the expression 
of the effort function, and appears instead in the dynamics equa- 
tions in the form of a representative function of the rotation. Be- 
cause such an effect—if it exists—appears to be too small for 
ready detection, this alternative formulation has not been pursued 
at this time. 


‘ 


IV. Mathematical Discussion. General mathematical methods ex- 
ist for writing the necessary conditions of minimization of the ef- 
fort function EL, satisfying at the same time all existing constraints. 
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In certain cases, instead of function F£, its time integral between 
two time limits may be involved. The operations furnish m — n new 
equations, precisely the number required to render the problem of 
human motion determinate. 

The procedure is outlined below, limited for simplicity to cases 
where the function F itself is concerned (for details, see Courant, 
1936).+ 

From the viewpoint of the form of the constraints, two cases may 
be distinguished: (a) the constraints involve the angles of body 
segments, joint moments and the time variable; (b) they contain 
only the joint moments and time, that is, they are of the type of 
equations (2). As it is often possible to convert case (a) to case 
(b), the discussion will be restricted to this latter to avoid mathe- 
matical involvements which are not of great interest here. 

It is therefore proposed to minimize effort function E (or, more 
correctly, to write that E is stationary), subject to constraints 
represented by equations (2), 

This requires setting the differential dE of function E equal to 
zero; thus, treating A¢ as a constant, 


C1M,dM, + coModMz +...+CmM,dM, = 9. (3) 
The differential elements dM;, dMo,..., dM,, are subject to the 


following conditions, obtained by differentiating equations (2), in 
which ¢ is treated as a constant, 


aM, aM; r 
Ey RR ayes hE agp int9 (4) 
OM, OM OM, 


owe 6 8 6 6 8 86 CS SDS 6 06'S OF 6D 0 8 6.4 € 0 86 ee 8 SF 


Using the method of Lagrange’s indeterminate multipliers, mul- 
tiply equations (4) by Ai, A2,--+An respectively, add to equation 
(3) and equate the coefficients of dM;, dMo,..., dM,, to zero; thus, 


tCases where a time integral of Z is to be minimized may be handled 
by methods of the calculus of variations (see, for instance, Margenau and 
Murphy, 1943). 
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fi ofa of n 
My +4 ——— + Ag —— rn =0 
1 SEP a aM, OM; ’ 
af: af af 
My + A; —— atte Hee ee 5 
Bp ae CASI Mo + Xo aMz 4% + An aM > ’ (5) 
Mn +1 hi ofa An fn = 0. 


Elimination of the n multipliers from the m equations (5) gives 
m ~n relations in the unknowns M and the variable ¢. According to 
the discussion in the text, this is the number of relations neces- 
sary to remove the indetermination of the equations of motion. 

A method of iteration is described briefly below for the solution 
of these equations, once they have been made determinate. 

The equations of motion assume their simplest form when written 
as moment equations applied to each of the segments of the body. 
They are linear differential relations of the second order in the 
trigonometric functions of the angles of the segments, or in these 
angles themselves if they are not too large. To simplify the dis- 
cussion it will be assumed that the unknowns listed under (a) and 
(d) on page 378 do not appear in them, having been eliminated be- 
forehand. Denoting the angles by the symbol & and their number 
by p, equations of motion of this type, also in number 7p, can be 
written in the following general forms (double dots’ indicating 
second derivatives with respect to time), 


Fy (lyeenes Xp; ieeens ae: tee Mn) = 9, 
Pa (Qisc~ss Opt A) yn ane no Mie eee 


gente ck Cie Ss m ahaa a'ia'e © aie ee ark ek oat eer (6) 


(In some cases, first derivatives of the angles, representing viscous 
damping, may also be present; the method applies equally to such 
cases), 

The m unknowns ™ are found as functions of ¢ from the m-—n 
equations referred to above, plus the n constraints of equations (2). 
They are substituted into equations (6) which then take the form 


ECT Pee Bre: eter en OW Nr 


«Sis ocak 3 ER ee es (7) 
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As a first approximation, neglect the second derivatives (as- 
sumed to be not too large) in equations (7), and solve for angles a 
in terms of ¢ For a second approximation, form the second de- 
rivatives of the angles just found, introduce into (7) and once more 
solve for the angles a. 

This process can be repeated as many times as desired. The 
angles will be found as a series of terms and will converge the 
more rapidly as the angular accelerations of the body segments are 
the smaller. 

The solutions found above for moments M are complete. For the 
complete solutions of angles «, however, add to the expressions 
found above the general solution of the homogeneous dynamics 
equations obtained by cancelling all M terms in system (6). This 
general solution can be obtained by the classic methods of the 
theory of linear differential equations and contains two arbitrary 
constants for each angle «. These constants can be determined if 
the initial values of the angles and of their first derivatives are 
known. 

The iteration method of solution described above is appropriate 
for operations by high speed computers. 


\. Illustration of the Minimal Principle. To illustrate by means 
of a simple example the operation of the principle of minimum effort 
in removing the indetermination of the equations of dynamics, the 
two-dimensional motion of a rough representation of the human fig- 
ure, made up of five segments only, is analyzed here briefly. The 
configuration of minimum effort is determined for the particular 
case of equilibrium in relation to a simple type of constraint. A 
mumerical application is included in which several positions of 
equilibrium are represented for comparison, only one of them con- 
forming to the minimal principle. 

The system is diagrammed in Figure 1. The physical character- 
istics of the segments (length /;; mass m;; moment of inertia about 
the center of gravity /:; moment of inertia about one end /; 4) are 
listed on the figure. The figure also shows the centers of gravity 
(c.g.) located with respect to a joint by means of the distances 
K 111, Kale, etc., where K’s are numerical factors. 

The system is supported at one point A, assumed given; the pri- 
mary unknowns are those listed under (b) and (c), page 378, that is, 
5 orientations 41, 42, X3, 44, 45 with respect to the vertical, and 
4 independent joint moments Mig, Mop, Map, Msp- (These 9 un- 
knowns are underlined in Figure 1.) 
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(X4p > Yap»M ap) 


(X38 ,Y38, Msp) 
(Xig,Yie, Mis) 


FIGURE 1. Schematic diagram. (Side view) Characteristics of seg- 
ments: Segment (1), (11,1, /1,/14); Segment (2), (/2,m2, /2,/2B); Segment 
(3), (13,m3, 73,/3B); Segment Cy (l4,ma, T4,l4p)3 Segment (5), (t5,m5, 
15,/5p)- 


The equations of motion are written in the form of five moment 
equations, one for each segment, each containing one or more of 
the 9 unknowns. The excess 9 — 5 =4 is the number of joint mo- 
ments, as expected from the general discussion. This is verified 
by comparing the number of all unknowns with that of all equations 
available (moment as well as force equations); these numbers, 
found to be 25 and 21 respectively, also differ by the number of 
joint moments (the 25 unknowns are shown in Figure 1). 

The simplest forms for the moment equations are found by writing 
that each body segment is in equilibrium under the effect of inertial 
forces and inertial moments, plus the forces and moments applied 
at its ends as reactions (d’Alembert’s principle). Assuming that 
the angles are small, these equations are the following: 


HOY = Gai42 — G13 + Gar hy + Geias + Ara, + Mp; 


pds = Gah) — Aghe + Mop; 
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HaX3 =-Ggity + Gagty + Gsgth + 4303 — Mig — Mop — Map - Msp; 
Inne = Garay + Gaga3 — Agcy + Map; (8) 
Isn&s = Gsit) + G@sgh3 —Asd5 + Msp. 
The symbols used in these equations are defined as follows: 
Goi = mM2K glali; Gai =(m3Kk 3 + mg + m5) 1311; Gar = mak glgly; 
Gs5i = m5K5l5li; Gag = Mm4K lglg; G53 = m5K 5l5l3; 
Hy = (mg +m3 +mg+ms)li +ha; Hs = (my +ms)l3 + Ip; (9) 
A, =(Kim,; + mo +m3+mg+ms)lig; Ao =moKolag; 
As =(Kamg + mq + ms5)139; Ag = m4K glag; As = m5K 5lsg. 


As an example of constraint, it will be assumed that the length 
of step is specified at each instant; that is, calling s(t) a given 
function of time, we have the condition (see Figure 1): 


—114; + loo = s(t). (10) 


Using the notation of the text, m = 4, n = 1; therefore the motion 
is 4-1 =3-fold indeterminate. The minimum effort principle is to 
furnish the 3 additional relations required. 

Because of space restrictions, the discussion will be simplified 
by confining it to the static case. This case is defined by dropping 
all second derivative terms in equations (8) and replacing the func- 
tion s(¢) in equation (10) by a constant so. For the static case, 
the angles need no longer be assumed to be small and will be re- 
placed by their sines. The equations are then the following, still a 
3-fold indeterminate system: 


A;,sind&, + Myg = 0; —Ae sind + Mog = 0; 
As sin &3 — Mig — Mog - Map - Msp = 9; (11) 

—A,zsinds + Msp = 0; —As sinds + Msp = 0; 
—1;sin&; +lesin&e =S8o. (12) 


Replace sin 4%; and sin &g in equation (12), 


ly lo 
Se ih — Mop = 8 We 
re 1B + aa 2B 0. (13) 


Minimize E (equation 1), subject to equation (13). Apply equa- 
tion (3) (all c’s assumed equal are removed) and equations (4), 
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Mip dMip + Mop dMog + Map GMap + Msp GMsp = 95 (14) 
ly lo 
—— dMig + — dMog = 0. (15) 
A; 1B Ay 2B 


In this simple case, the 3 additional equations needed can be 
found directly without the use of indeterminate multipliers; they 
are: 


MiB ph Mop is Map he Msp (16) 
ay 4 8 A aie 
A, Ag 
These equations, together with equation (13), give: 
l lo 8 
Mig =~ —93 Map => —93 Map = 05 Map = 05 (17) 
19 a9 
2 
Mip + Mi + Mib + Mop = 3; (18) 


in which q? denotes the quantity (J;/A1)? + (J2/A2)?. 
The angles can now be determined, using equations (11) and (17), 


F ly So . lo So 
Ssinvay = ——— —-: Sin &a)— — : 
- 1 ly lo So 
ts =—— (—— +—=] —33 a4 = 0; a5 = 0; 
aia: As (a bi 2) q? - z 


These angles define the minimum effort configuration of the 
static system, under constraint specified by equation (12). It is 
one of many equilibrium positions satisfying the constraint. The 
joint moments corresponding to this configuration are given by 
equation (17). Equation (18) furnishes a measure of the total mini- 
mum effort involved. 


VI. Numerical Application. If the system just discussed (Figure 
1) is made to correspond to the framework of an individual 5 ft. 9 in. 
in height, weighing 160 lbs., the several segments would have the 
following approximate characteristics (according to O. Fischer): 


m1g = mg = 30 lbs; mag = 80 lbs; mag = msg = 10 lbs 
1; =lg = 87 in; 73 = 18 in. (to shoulders); 74 =75 = 30 in, 
Ky =Ko =K, =K, = 0.50; Ks = 0.55. 
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Then: 
A, = 5870 in.-lbs; Ag = 555 in.-lbs; 43 = 1150 in.-lbs; 
A, =A; =150 in,-lbs 


l 
oH '6.9%105 Ib-?, 22 6601 < 100 lbh <7 ee 4.5% 107° Ib™?, 
A; Ag 

Taking so = 20 in., we have so/q? = 4.45 x 10° in.-lb?. The fol- 


lowing numerical values, given by equations (17) and (19), define 
the configuration of minimum effort, under the given constraint: 


Mig = 31 in.-lbs; Mog = 297 in.-lbs; Map = Msp = 0; 
sin 4; =— 0.006, (a; =—0° 21’); sin a2 = + 0.535, 
(A. = + 82° 21’); 
sin 3 = + 0.286, (43 = + 16° 37’); d4 = 45 = 0. 


The sum of the squares of the moments, a measure of the total 
effort involved, is given by equation (18) as: 


Mix + Msg + MZ + MZ = 89,000 (in.-lbs)? 


This equilibrium position of minimum effort is shown in Figure 
2, together with the angles of the segments, the joint moments and 
the sum of their squares. For comparison, two other arbitrary po- 
sitions, also in equilibrium and under the same constraint, but not 
governed by the minimal principle, are shown in Figure 3 with the 


+16°,37' 


Forward 


FIGURE 2. (Side view) Equilibrium position. Minimum effort. (Mo- 
ments in inch-pounds.) 
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onal 


| 
> M* 152,000 
| 


FIGURE 38. (Side view) Equilibrium positions. Varying magnitudes of 
effort. (Moments in inch-pounds.) 


associated angles, moments and sum of squares. The differences 
between the corresponding amounts of total effort, as represented 
by the sums of squares of moments, are to be noted. 

Simple, preliminary tests of the principle of minimum effort could 
similarly be attempted on the basis of a comparison between several 
configurations of equilibrium. Unwarned subjects could be re- 
quested to assume their ‘‘natural’’ posture of equilibrium under the 
same given constraint. One could then ascertain whether or not a 
significant majority chooses the position corresponding to the 
minimum effort. Tests of this character may cover a wide range of 
postures, and could be extended to simple types of motion. 


This study was conducted under a grant from the Research Di- 
vision, College of Engineering, New York University. 
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It is pointed out that two fundamentally different views of primary 
genetic processes occur in the literature which are frequently confused. 
The first is a true communication-theoretic view, which regards the 
genetic apparatus as containing a real information-source and a trans- 
ducer which converts that information to useful form. The second view 
is generally expressed as a template scheme based on the Watson-Crick 
model; it is shown that in this model there is actually no such thing as 
genetic information in a communication-theoretic sense. Both views are 
then discussed on the basis of microphysical principles developed in 
previous work of the author (Bull. Math. Biophysics, 22, 227-255, 1960) 
in an attempt to find which approach is in closer accord with the biologi- 
cal facts. It is shown that, if the communication-theoretic view is cor- 
rect, then the information-bearing object must act as a ‘‘catalyst,’’ but 
it is pointed out that the type of catalysis involved must be of a funda- 
mentally different nature than that occurring in familiar enzyme-catalyzed 
reactions. On the basis of general considerations of irreversible changes 
in microphysical measuring systems, it is shown that any type of template 
must suffer a gradual and irreversible denaturation, which seems to make 
it unlikely that a template could play a primary role in fundamental ge- 
netic processes. 


In a previous work (Rosen, 1960) we have briefly explored a 
possible quantum-theoretic approach to primary genetic mecha- 
nisms. This approach was based on the assumption that primary 
genetic information was carried by a family of states of a particular 

*This research was supported by the United States Air Force through 
the Air Force Office of Scientific Research of the Air Research and De- 
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microphysical system ©, and was specified by the value assumed 
on each of these states by a specific observable A&G. This ap- 
proach requires, as we pointed out, that there must exist in each 
cell a suitable microphysical observing apparatus, capable of de- 
termining the A-values of the states of © contained in the genome 
of the cell. It was remarked at that time that the macromolecules 
currently envisioned as being involved in primary genetic proc- 
esses (DNA, RNA) could be regarded as comprising a portion of 
this observing apparatus in a way that was entirely consistent 
with experiment. 

This approach described above was implicitly based on an anal- 
ogy between the genetic process and the operation of communica- 
tion systems as understood in other contexts. That is, it was in- 
herently supposed that there existed a repository of information, 
and a transducer (the observing apparatus) which converted this 
information into a useful form. As we pointed out in loc. cit., the 
only means by which a microphysical system can convey informa- 
tion is by means of the values assumed on the states of that sys- 
tem by appropriate observables, so that this analysis leads im- 
mediately to the formulation studied therein. 

A cursory inspection of the literature of theoretical genetics 
might lead us to suppose that the term ‘‘primary genetic informa- 
tion’’ as used in that literature is precisely what we called ‘‘in- 
formation”’’ in our microphysical approach. However, an attempt to 
relate this approach to the specific mechanisms suggested by the 
Watson-Crick model for the performance of primary genetic activities 
reveals that this is not the case. Indeed, the term ‘‘information’’ 
as used in connection with genetic models based on the Watson- 
Crick structure bears no relation whatever to the same term as we 
have employed it in doc. cit., or as it is employed in communication 
theory. As a result, there exists a real syntactical confusion in 
the genetic literature. 

The situation is basically as follows: the Watson-Crick model 
has rendered it plausible that, if DNA or RNA is indeed, in some 
sense, the primary genetic substance, then the ‘‘information’’ car- 
ried by these macromolecules is transmitted by means of some sort 
of template mechanism, where the template is determined by the 
particular configuration of nucleotide pairs along the molecule. 
It shall become clear as we proceed that, in microphysical terms, 
the template models of genetic activity amount to asserting that 
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these macromolecules do indeed function as microphysical observ- 
ing systems; however, instead of observing ‘‘primary genetic in- 
formation’’ as was supposed in loc. cit., they actually observe ap- 
propriate constituent components of the primary gene products. In 
communication-theoretic terms, the nearest approach that can be 
made to the description of the behavior of these macromolecules 
seems to be that the role of information-source and transducer are 
inextricably combined. In fact, in a strict sense, it will be seen 
that as far as the template models of primary genetic activity are 
concerned, there is no such thing as genetic information; the gen- 
ome of a cell must be visualized, under these models, as consist- 
ing of a family of microphysical measuring systems of certain 
specific types, the properties of which we shall investigate in 
greater detail in the following discussion. 

It may be of interest to compare the view of the role of ‘‘infor- 
mation’’ in the template model of genic action which has been 
described above to that developed by Rashevsky (1960) for genic 
replication. Rashevsky points out that, since the behavior of such 
templates is inherently deterministic, any attempt to assign an 
‘*information content’’ to these templates must yield an information 
content of zero. Our own result is even stronger than this, in the 
following sense: given any structure involving the juxtaposition of 
a large number of different kinds of elementary objects (such as 
nucleotide pairs) it is always possible to use combinatorics in 
various ways to define an ‘‘information content’’ for the structure. 
Rashevsky has shown that the template theory requires that any 
such ‘‘information content’’ of a DNA molecule must necessarily 
be zero. However, all these ‘‘information contents’’ are defined 
purely formally, and there is therefore no a priori guarantee that 
any of them corresponds to the communication-theoretic procedure 
which actually takes place in real genetic mechanisms, What the 
microphysical analysis reveals is in effect that none of the com- 
binatoric means of defining an ‘‘information content’’ bears any 
relation to the actual ‘‘information transfer’’ in the template theory. 
In other words, Rashevsky has shown that if communication-theo- 
retic arguments could be applied in any way to a deterministic 
template model of genic activity, we would find that the templates 
carried zero information; we have shown that there is actually no 
way to apply communication-theoretic arguments to the template 
models in a way consonant with their actual physical activity. 
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Let us now return to the main line of the argument. We have 
seen that there are two conflicting views implicit in the literature 
as to the microphysical structure of primary genetic systems. On 
the one hand, we have the view of those who regard primary genetic 
activity as being in one sense or another a genuine communication 
system, with an information source and a transducer (observing 
system) to convert the information into a useful form. As opposed 
to this view, we have the models based on the Watson-Crick struc- 
ture, which envision no information source, but merely an ensemble 
of observing systems which in effect scan the environment for 
suitable constituents to be organized into ‘‘primary gene products.”’ 
The two theories which result are not equivalent, and it becomes 
important to investigate which of them is in closer accord with 
biological realities. It is the purpose of the present note to briefly 
explore some of these matters. 

Let us first observe that a central role is played in both theories 
by the notion of a microphysical observing or measuring system. It 
is natural to suppose that, whichever picture of genetic activity is 
correct, the various processes involved will be of a biochemical 
nature. It may therefore be best to begin our investigation by con- 
sidering the efficiency of chemical systems in carrying out micro- 
physical observations. By an observation, we shall mean the de- 
termination of the value of a microphysical observable on some 
appropriate state of a given microphysical system, in such a man- 
ner as to leave unaltered the value which is determined by the 
observation. A process which determines such a value but which 
may alter that value will be referred to as a measurement. Thus, 
we may say that an observation is a measurement which is in- 
herently repeatable. 

Let us note that the requirement that the microphysical imple- 
mentation of genetic mechanisms be carried out by observing sys- 
tems and not mere measuring systems is quite essential to what 
we may call the ‘‘communication’’ theory, but is not at all neces- 
sary to the template schemes. This is because the biological facts 
require that no change in the genetic information may be made by 
the act of determining that information; otherwise, repeated meas- 
urements will give the cell different instructions and chaos would 
be the result. On the other hand, since the template theory does 
not deal with information in this sense at all, there is nothing that 
the measurement process need preserve a priori. However, we 
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shall see that some of the most essential properties of observing 
systems are also manifested by general measuring systems, and as 
far as these properties are concerned, the distinction is not im- 
portant. Nevertheless, it will be useful to keep the two situations 
separate throughout the following discussion. 

It is not hard to see that any arbitrary chemical reaction can be 
considered as a microphysical measuring system, in the following 
manner. Let us consider the general reaction 


A; +Ao+...+A, — products. (1) 


We shall interpret the A; in (1) in a somewhat broader manner than 
customary; we shall allow each A; to represent not only a molecu- 
lar species (or rather the ensemble of observables which character- 
izes the species in question) but also an appropriate subfamily of 
such an ensemble; more generally, A; will also be allowed to des- 
ignate such an object as a light quantum or any other physical 
agent which participates in the reaction (1). 
In quantum-theoretic terms, the reaction schematized in (1) 
means the following: there exist n microphysical systems S81, Sa, 
, S, such that, if A; represents an appropriate state of the sys- 
tem S;, the state A; + Ag +...+A, of the composite system S; ® 
Sg ®... 8S, can undergo a transition to another state, denoted in 
(1) by the word ‘‘products.’? Now we can observe that each state 
A; is specified when the (expectation) values of the observables 
of S; on A; are given. In fact, if all the observables of S$; commute, 
we can employ the argument of loc. cit., p. 232, to infer that the 
knowledge of a single observable of S; on A; will suffice to de- 
termine A;. Thus, we can express the reaction (1) completely in 
quantum-theoretic terms, as a relation between the values assumed 
in certain states by an appropriate family of observables. 
Let us now suppose that we are given a situation in which n - 1 
of the reactants A; are present, and one reactant (A;,, say) is ab- 
sent. It then follows that the reaction (1) allows us to regard the 


state ee he as a measuring apparatus for detecting appropri- 
iti, * 

ate states A;, of the corresponding system S;,- For it is clear that 

only a certain set of the states of S;,, will allow the reaction (1) to 

proceed, and those states are characterized by definite values of 

appropriately defined observables of 8;,. Moreover, we know that, 
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in general, two different states of S;, will lead to formation of 
recognizably different products. In fact, in most cases reactions 
of the type (1) depend entirely on systems S; consisting entirely of 
commuting sets of observables, so that these reactions can always 
be regarded as capable of measuring the value of a single observ- 
able of the system S,,. 

A number of remarks may now be made with reference to the be- 
havior of reactions of the type (1) as microphysical observing sys- 
tems. First, we note that in general the reaction (1) will cause a 
change in the state A;, (or more specifically, in the observable(s) 
specifying that state), and therefore the given reaction generally 
fails to act as a true observing system. Further, we must point 
out that most known reactions of the type (1) do not suffice to dis- 
tinguish between a wide range of values of the observable(s) spec- 
ifying the state A;,. That is, most known reactions will in most 
cases occur for a small range of the values of the relevant ob- 
servables; for all other ranges of possible values, a reaction will 
not take place. We may express this in a suggestive manner as 
follows: the bandwidth of this type of measuring apparatus is gen- 
erally very narrow. 

With regard to the second observation, we may make the follow- 
ing remarks: In the first place, it may be argued that the particular 
reactions of type (1) with a sufficiently wide ‘‘bandwidth’’ to dis- 
tinguish many states A;, may merely not yet have been investi- 
gated, and in any case such reactions may well be indigenous to 
the biological systems in which they occur. It does not seem to 
be possible at present to deny this possibility. Moreover, we may 
further argue that it is possible to widen the ‘‘bandwidth’’ of micro- 
physical measuring systems of this type by considering, instead of 
a single reaction of the type (1), an entire famzly of such reactions, 
each capable of occurring in the presence of a different band of 
values of the appropriate observable(s). 

As an example of these suggestions, let us suppose that we 
desire to determine the various frequencies present in a given 
heterochromatic beam of light. In order to do this, we may attempt 
to utilize a photochemical substance of some type. If we could 
discover such a substance which, upon the absorption of quanta 
of various different frequencies, gives rise thereby to distinguish- 
able reaction products, we could deposit that substance on a 
photographic plate, allow the given beam to impinge on the plate 
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for a suitable time, and then examine the plate by known macro- 
scopic means to determine the various reaction products and hence 
infer the frequency spectrum of the original beam. However, it is 
known that most such substances will react only in the presence 
of a comparatively narrow range of energy value. Hence it is sug- 
gested to try to find an ensemble of photochemical substances, 
each reacting for a different frequency range of incident quanta, 
such that the resulting reaction products are mutually distinguish- 
able. By using a larger and larger ensemble of appropriate sub- 
stances, we may in principle make the bandwidth of such an ap- 
paratus as wide as we like, 

Let us now take up the question of whether it is ever possible 
for a chemical system of type (1) to carry out actual observations 
instead of mere measurements. This point is most important to the 
present discussion, because if we could show that it is not pos- 
sible for a system (1) to carry out a true observation, we would 
thereby in effect prove the impossibility of any communication- 
theoretic picture of the operation of the primary genetic apparatus 
(or at least any such mechanism which can be expressed in bio- 
chemical terms). 

More specifically, we observe that if we regard the reaction (1) 
as operative in a communication-theoretic picture of primary ge- 
netic activity, then the state A,, (or rather the values of the appro- 
priate observables of S,;, on that state) will play the role of the 


n 
‘*bearer of genetic information,’’ while the composite state >: A; 
eign 
must play the role of the observing apparatus. As we have seen 
above, if the state A;, undergoes any kind of transition involving 
the ‘‘genetic’’ observables as a result of the reaction (1), then the 
information-bearer is thereby destroyed. However, we know that 
the genetic information, if it exists, must be transmitted intact for 
an unlimited number of generations. The only way that the reac- 
tion (1) can fulfill this requirement is for it to be of the form 


a i A;+A;,— products + 4;,. (2) 
ree 


That is, the state A;, must again appear as a product of the re- 
action (1). 
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The form of (2) is equivalent to saying that A;, acts as a cata- 
lyst for the reaction. We may then sum up the discussion in the 
following way: a chemical reaction can serve as an observing sys- 
tem only if the observed system plays the role of a catalyst for the 
reaction in question. However, it must not be supposed that this 
result implies that a communication-theoretic view of primary ge- 
netic activity is equivalent to asserting that the gene acts to form 
primary gene products by a simple catalytic process (cf. the dis- 
cussion in Goldschmidt, 1958, p. 267 ff.). In the present situation 
the catalysis in question is occurring in the observing apparatus, 
and serves to change the state of the observing system in some 
manner; the final result of the observation (i.e., the primary gene 
product) may be formed in an entirely different manner. We shall 
not analyze this important point any further here, since we shall 
discuss these matters in detail in a forthcoming publication; we 
merely wish to point out that the result we have obtained above is 
not so simple to interpret as may at first appear. Our main purpose 
here is to show that a communication-theoretic view of genetic 
processes cannot be immediately discarded. 

We shall now show that the general quantum-theoretic laws of 
measurement can be used to derive important properties of measur- 
ing systems of type (1) as they occur in genetic processes. Asa 
result of his exhaustive analysis of the general microphysical 
measuring process, Bohm (1951, Chapter 22) has shown that there 
is a fundamental irreversibility connected with the operation of 
any microphysical measuring system. This irreversibility arises 
from the fundamental laws of quantum mechanics and cannot be 
removed without rendering the quantum-theoretic formalism incon- 
sistent; it is moreover a strictly quantum phenomenon with no 
analog in classical theory. We quote directly from Bohm (p. 609): 
**...in classical theory fundamental variables (such as position or 
momentum of an elementary particle) are regarded as having defi- 
nite values independently of whether the measuring apparatus is 
reversible or not; in quantum theory we find that such a quantity 
can take on a well defined value only when the system is coupled 
indivisibly to a classically describable system undergoing irre- 
versible processes. The very definition of the state of any one 
system at the microscopic level therefore requires that matter in 
the large shall undergo irreversible processes.’’ For fuller de- 
tails, we refer the reader to Bohm’s discussion. 
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Now let us return to the reaction (2). In this Situation, the ob- 


serving apparatus is the composite state » ; A;. By the above 
nigel 

discussion, it follows that the process of observing A;, by means 

of this apparatus must necessarily be attended by irreversible 

processes in the apparatus. In other words, the reaction (2) must 

be zrreversible. 

But this means that the reaction (2) must be of a most peculiar 
type. For in its form the reaction (2) is identical with all the fa- 
miliar enzyme-catalyzed reactions of biochemistry, where the state 
A;, is interpreted now as an appropriate enzyme molecule. But all 
enzyme-catalyzed reactions are in principle completely reversible 
reactions, and there is thus no occurrence of the irreversibility 
which, as we have pointed out, is essential to any microphysical 
measuring process (observation or not). 

In order to understand this apparent discrepancy between the 
two possible interpretations of (2), and at the same time obtain 
insight into the template models of gene action, let us consider 
enzyme-catalyzed reactions in greater detail. It is well known that 
the simple statement (2) does not reflect the actual mechanism of 
the reaction. The existence of enzyme-substrate compounds has 
been postulated in the fundamental enzyme-kinetic studies of 
Michaelis and Menten and many subsequent authors, in the rate- 
theoretic studies of Eyring and others, and finally proved by the 
actual spectroscopic observation of enzyme-substrate intermedi- 
ates. Thus, a typical simple reaction of the type (2) involving an 
enzyme £ which can be written as 


A; + Ag + E=products +E (3) 
may actually occur in the following stages: 
A,+E= AiE (4) 
A,E + Ag=—Ai1E Ag (5) 
A,EAzg=— products + E. (6) 


The important thing to note is that each of these component re- 
actions is now of the form (1), and the discussion of such reactions 
as microphysical measuring systems may now be applied. Thus, in 
reaction (4), it is natural to regard the enzyme E' as comprising a 
measuring system for A;; it is likewise natural to regard the com- 
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plex A;E as a measuring system for Ag. Thus, far from being a 
carrier of information, as might have been assumed from the over- 
all reaction in the form (3), we see that the actual reaction mecha- 
nism requires EF to function as a@ measuring system. 

As we have pointed out, the activity of a measuring or observing 
system is necessarily attended by the occurrence of irreversible 
processes in that system. In the present context, this means that 
the enzyme molecule itself will suffer these irreversible processes; 
that is, in the course of repeated measurements the enzyme mole- 
cule must become denatured, and ultimately cease to function. 
Such denaturation is familiar in enzyme-catalyzed reactions; it is 
usually attributed to impurities in the reaction mixture or to side 
reactions. However, the above analysis reveals that this type of 
denaturation is a consequence of the microphysical nature of the 
reaction, considered as a microphysical measuring apparatus in 
the manner described above. 

It might be pointed out that this type of inherent denaturation is 
highly reminiscent of the inactivation of a Maxwell demon, as 
pointed out by Bohm (op. czt., p. 608n). The resemblance of an 
enzymic catalytic agent to a Maxwell demon has also been dis- 
cussed by Norbert Wiener (1948), 

Let us now note that precisely the same argument as we have 
applied to enzyme-catalyzed reactions can be applied to syntheses 
involving a template. For let us suppose we are given a template 
T, capable of forming a polymer of the form A;A2...A, from ap- 
propriate monomers 4; (some of the A; may of course be the same) 
in the environment. Not all these monomers will arrive simulta- 
neously at the template; suppose for simplicity that each monomer 
except the kth has been bound on the template to form a complex 
which we may denote as TA;A2Q...A,~14441...4,- This com- 
plex will then form a measuring apparatus for A,;, in precisely the 
Same manner as discussed previously. By the same argument we 
used in our treatment of enzyme-catalyzed reactions, we can infer 
that the template 7 must undergo irreversible processes, and will 
ultimately suffer complete denaturation. 

We may now point out that, although the denaturation of individual 
enzyme molecules is of no moment as far as enzyme activity is 
concerned (since enzyme is being continually synthesized), such 
denaturation is completely inadmissible in a template purportedly 
bearing the primary genetic instructions, because such a template 
must be capable of being borne intact through an unlimited number 
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of generations. It may be argued that this denaturation may take 
place so slowly that it may be ignored in comparison with the life- 
time of a cell, but we may note (a) that since this denaturation is 
irreversible, its effects must accumulate in progeny cells, and (b) 
that the replication of cells is far more sensitive to denaturation 
effects (at least those due to high-frequency radiation) than is pro- 
tein synthesis. This last remark implies that, under the template 
theory, cells must lose their capacity for division before they cease 
to metabolize. It may be noted that this is actually the case, but, 
nevertheless, the above considerations concerning denaturation 
make it difficult to maintain the tenability of a pure template theory 
of primary genetic activity. 

Thus, we see that there must be a fundamental difference be- 
tween a reaction of the type (2) which can serve as a microphysi- 
cal observing system in a communication-theoretic picture of ge- 
netic processes and an enzyme-mediated reaction. Most important, 
we see that there can be no complex-formation between the state 
A;, bearing genetic information (which according to the discussion 
above can be regarded as a ‘“‘catalyst’’) and the other A; which 
play the role of the observing system. Only in this manner can we 
insure that the information-bearing state will not be denatured as a 
result of the irreversible processes which must invariably enter 
into the process of observation. This enables us to make a sharp 
distinction between the ‘‘catalysis’’ involved in a communication- 
theoretic view of the gene and the superficially similar catalysis 
familiar in enzyme-mediated reactions. 

The investigation carried out above makes it clear that a close 
examination of the relation obtaining between a microphysical 
measuring system and the system it observes will yield useful in- 
sights into primary genetic activity. As remarked above, we intend 
to return to these matters in greater detail in a forthcoming pub- 
lication. 
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The theory developed in previous papers and based on distribution 
eurves of definite form is generalized to any form of unimodel distribu- 
tions. The time course of the change from one behavior to another is 
discussed and a general theorem about the time course is established. 


We shall study some properties of the basic equation in our 
theory of imitative behavior (Rashevsky 1959; 1960). Familiarity 
with previous work is assumed. The notations are the same as 
used before. 

We have 


SYA SY) ad (1) 
dt 
The quantity X - Y, when plotted against ¥, is represented by a 
curve shown in Figure 1. The exact shape of this curve depends 
on the assumptions made about the distribution function of the ten- 
dency ¢ for either of the two behaviors. However, if ¢ is symmet- 
ric with respect to ¢ = 0, then the curve is symmetric with respect 
to the origin 0. It is monotonically increasing from —-N to +N and 
has one inflection point at y = 0. On the positive branch the curve 
has a monotonically decreasing first derivative; on the negative 
branch the first derivative is monotonically increasing. We shall 
study equation (1) under these most general assumptions. 
Put 


a (2) 
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FIGURE 1 
so that 
X-Y=N(ze-y). (3) 
Furthermore put 
AN =Ao, (4) 
and 
r-y=2, (5) 
The equation (1) becomes 
ay 
— =Aoz - aw. 6 
c 0 y (6) 


The quantity 2 considered as a function of w is represented by 
a curve of the same type as that shown in Figure 1. The asymp- 
totic values of 2 are, however, —1 and +1. 

Instead of the function 2 (yw), consider the inverse function wv (2). 
(Figure 2.) For 2=-—1, we have ~=-—~, di/dz2=+0. Fora = 
+1, we have ~ =+0; dW/dz=+0. For 2 =0, we have w =0, 
dy/dz > 0. 

Denote 

dy 1 
aes an AE 8 7 
fe 71 Ty He) (7) 
We have: 
w(-1)=- 0; ¥(0)=0; W(+1) =+ &; 
f(-1) = f(+1) = + 95 u(-1) = u(+1) = 0; (8) 
u(z)>0 for -1<2<+1. 
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¥ 
={ leer 
FIGURE 2 
Introducing (7) into (6) we obtain 
e = Ap u(2) 2 — au(z) (2). (9) 


Consider the first term on the right side of (9). Because of (8) 
this term is zero for z = 0 and z = +1, and is positive in between. 
Hence it must have at least one maximum between z = 0 and 2 = 
+1. We shall first consider the case where there is only one maxi- 
mum. We shall subsequently see that this is the only possibility 
in the case under consideration. 

The first term of the right side of (9) is therefore represented for 
0<2<+1 by a curve similar to the full line in Figure 3. The 


A, u(Z) 


——— dat 


FIGURE 3 


408 N. RASHEVSKY 


curve may be or may not be tangent to the 2-axis at 2 = 1; it also 
may or may not have inflection points. 

For 0 <2 <1,the second term of the right side of (9) is negative 
because of (8). We have always w(+1)u(+1)=90. Also because 
of (8) the term is zero for 2 = 0 and for z = 1. Hence it has at least 
one minimum between z =0 andz=1. At 2 =1, the derivative of 
the first term with respect to 2 is 


Aou’ (1) <0, (10) 
while the derivative of the second term is 
-alu(1) ¥’(1) + w(1)u’ (1)]. (11) 
Because of (7) expression (11) is equal to 
-alleu (je GO) (12) 


If u’(1) = 0, then, even if w(1)u’(1) = 0, the absolute value of 
expression (12) is always greater than that of (10), for any Ao and 
a. If u’(1)< 0 but finite, then, since (1) = ~, we still have the 
same situation. Even if u’(1)=—«, the absolute value of (12) 
goes to infinity more rapidly than the absolute value of (10). Hence 
always 


| Aow’(1)| < | afi + vv’ (1)]]. (13) 

Therefore the right side of (9) is always negative in the neigh- 
borhood of 2 = 1. 

The first derivative of the first term on the right side of (9) 

is Ao [u(z) + 2u’(z)] and is finite at 2 = 0 because of (8). Hence, 

by taking a sufficiently small a@ we can always ensure that at 


z2=0 the absolute values of the first derivative of the second 
term of the right side of (9) will be less than the absolute value 


of the first derivative of the first term. In that case, in the 
neighborhood of 2 = 0, dz/dt will be positive. As we have seen, 
however, it is always negative in the neighborhood of 2 =1. 
Hence in the interval 0 < 2 <1, dz/dt will have at least one posi- 
tive maximum, then become zero, reach at least one negative mini- 
mum, and again become zero for 2 = 1. See Figure 3, broken line. 

Because of the symmetry of the &(z) curve with respect to the 
point 2 = 0, we find for -1 < 2 < 0 the curve shown in Figure 3 for 
Beis 

This situation corresponds to the case shown in Figure 1, where 
the straight line aw intersects the S-shaped curve at three points. 
As we have discussed in loc. cit., the point O corresponds to an 
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unstable equilibrium, while 0; and Q2 correspond to stable equili- 
bria. Figure 3 shows the same situation. 

One thing which appears puzzling at first is the fact that ac- 
cording to Figure 3, dz/dt = 0 also at 2 =-1 and z =+1, while 
according to Figure 1, dw/dt, which is given by the difference of 
ordinates of the straight line and of the curve, tends toward + on 
the positive side and toward —~ on the negative side. However, 
while for Wy =+e or 2=+1, dW/dt =+40, the value of dz/dt is 
zero. The reason for this lies in the fact that when w = +0 the 
distribution curve No(¢) is shifted so far to the right that even 
a large change in w does not affect 2. 

Now consider the situation where the distribution function N (¢) 
is displaced toward negative ¢’s. This has the same effect as a 
shift of the straight line in Figure 1 to the left, in other words as a 
change of the last term on the right side of (1) from aw toayw + 4, 
where 6 is a positive constant. Hence, instead of (9) we obtain: 


3 = Au(z)2 — au(z) w (2) - bu(a2). (14) 
The graph of dz/dt against z is now changed by adding to the 
curve —bu(z). That curve is shown by the dotted line in Figure 3. 
If f(z) is analytic everywhere, then f’(0) = 0, because f(z) has 

a minimum at z = 0, Since 


u(z)=- , <0, (15) 


we see, because of (9), that w(z) has a horizontal tangent at z = 0 
and at 2 = +1, as shown in Figure 3. The value of dz/dt as a func- 
tion of 2 in the range 0 < 2 <1 is given graphically by the sum of 
the broken and the dotted lines of Figure 3. If 2; is the abscissa 
of the maximum of the broken line, then it is readily seen that the 
addition of the dotted line shifts the maximum to the right because 
now at 2 = 2;, the derivative of the sum is positive. At the same 
time the value of the maximum decreases. For a sufficiently large 
b the maximum value will become zero, as shown on the right side 
of Figure 4, where the curve represents dz/d¢ as a function of 2 
for a sufficiently large 0. 

By a similar argument we see that the broken line for -1< 2 < 0 
will be so changed by the addition of the dotted line that its mini- 
mum will be shifted to the right. Since for 2 = -1, u’(z) = 0, there- 
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fore dz/dt will remain positive in the neighborhood of 2 =~1 for 
any value of b, as shown in Figure 4. 

The situation represented by the broken line in Figure 3 corre- 
sponds to the one represented by the full straight line in Figure 1. 
There are two stable configurations, 0; and O2, and one unstable, 
O. The situation represented in Figure 4 corresponds to the one 
represented by the dotted line in Figure 1. Only point 02 is now 
stable. If for 5 = 0 the social configuration was represented by 
0,1, then, when the critical value of 6 is reached, a change to con- 
figuration O2 will occur. 

For 5 = 0, when there are two stable configurations, the absolute 
value | zi| of the abscissa of the maximum of the broken line in 


FIGURE 4 


Figure 3, that is, of the maximum of dz/dt, for 0 < 2 <1, is equal 
to the absolute value | 22| of the abscissa of the minimum of dz/dt, 
for -1<2<0. But for 6>0 both the maximum and the minimum 
are shifted to the right. Hence now the new abscissae are such 
that (Figure 4): 
l2il > leat. (16) 
But the value of dz/dt at 2 = 24 is the highest rate of change 
from one configuration to another during the process of transition 
from 0; to Og. Hence we have the following theorem: 


During a transition from one configuration to another caused by 
the first configuration becoming unstable, the fraction of individ- 


IMITATIVE BEHAVIOR 411 


uals still exhibiting behavior R; at the time when the rate of change 
is the greatest is less than one-half but greater than the fraction 
of individuals which exhibited behavior Rz at the moment when in- 
stability set in. 

If we consider the whole conceptual scheme as a crude mathe- 
matical model of a revolutionary change, we may restate the above 
theorem thus: 

When revolution reaches its crest, the fraction of individuals 
who joined the revolution is greater than one-half but less than 
was the number of antirevolutionaries at the moment of the be- 
ginning of the revolution. 

In principle this statement may be verified by observation. 

We must now discuss the possibility of more than one maximum 
of the curve shown by the full line on the right of Figure 3, as 
well as the possibility of more than one zero point for the broken 
line for 0<2<1. If this will occur, then we would have more 
than two stable equilibria. Such a situation has been studied by 
H. D. Landahl (1950, 1953) but it corresponds in Figure 1 to a 
curve the derivative of which is not monotonic. Inasmuch as Fig- 
ures 3 and 4 represent the same physical situation as Figure 1, 
more than one zero-point for (dz/dt),-9 cannot occur for 0<2<1, 
nor can zu(z) have more than one maximum. 


This work was aided by a grant from the Dr. Wallace C. and 
Clara A. Abbott Memorial Fund of the University of Chicago. 
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In certain situations like the aftermath of a revolution when discon- 
tent rises amongst certain groups of the population, it is frequently ob- 
served that the discontented groups are firmly convinced that their point 
of view is shared by the majority of the population. Yet future events 
prove that this is far from being the case. This effect is partly attribut- 
able to ‘‘wishful thinking,’’ partly to a purely social mechanism. The 
wishful-thinking effect may be considered as a case of psycho-physical 
discrimination in which a bias is introduced proportional to the degree of 
satisfaction anticipated from a given situation. H. D. Landahl’s well- 
known equations can be applied to this case. The social factor is based 
on the circumstance that an individual associates by preference with 
such other individuals as have similar opinions. This results in an 
actual error of estimation of the relative minority or majority because of 
different frequencies of contact with individuals of the two opposing 
groups. Both factors may be combined into one equation. 


Some quite old experiments made on small, isolated college com- 
munities (Sherif, 1936) have shown that representatives of a very 
small minority have a tendency not only to underestimate their 
minority but actually to believe that they belong to a majority. 
This situation is particularly pronounced in various emigrant groups 
which have left their native countries as a result of some social 
upheaval. To listen to the hundreds of thousands of the White 
Russian emigres some 25 or 30 years ago, and sometimes even 
now, the majority of the population in the USSR is definitely op- 
posed to the Communist regime. The regime is represented as 
tottering on the brink of collapse. Yet, in spite of all such pre- 
dictions of collapse, it is now quite evident to any unbiased ob- 
server that it is as stable as can be. It has survived revolts, 
interventions, and the German invasion. 
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Similarly, in the opinion of the anti-Castro Cuban emigrants, the 
majority of the Cuban population is against Castro. Taking this 
assertion much too literally has cost the United States the head- 
ache of the Cuban invasion debacle. 

It is a legitimate problem of mathematical bio-sociology to study 
the possible mechanisms of the situation discussed above. The 
simplest explanation is that of ‘‘wishful thinking.’’ How can 
‘‘wishful thinking’? be interpreted in terms of the mathematical 
biology of the central nervous system? 

The evaluation of whether one or another social group represents 
the majority is a particular case of psycho-physical discrimination 
(Rashevsky, 1959, chap. ii, iii). Directly or indirectly, an indi- 
vidual observes or estimates the relative number of persons in two 
different groups and makes a judgment as to which is larger. We 
can apply to this situation Landahl’s well-known theory (Rashevsky, 
1959, chap. ii). The stimuli which are to be compared are now 
directly related to the numbers N; and Ne of the individuals in the 
two social groups. Therefore, the corresponding central excita- 
tions €; and €2 are also directly related to N; and No. Thus 


Nyce Fea ey ees 


(1) 


Ny > Mg Dbeq>tess 
For sake of definiteness, let N; > Ng. Consider an individual / 
who belongs to the minority group of N2 persons. Consider here, 
for simplicity, the case of zero thresholds. The probability of this 


individual making the wrong estimate and believing that actually 
Ni < Nog is given by (Rashevsky, 1959, chap. ii, eq. (7)): 


-C&\— 2) 

Pox [  ” ptayaa, (2) 
where p(A) is the distribution function of the central fluctuations 
which cause erroneous judgments. The quantity P,, represents the 
probability that a ‘‘minority individual’? will feel that he belongs 
oe to the majority. When €; = €, that is Nj =No, then P, = 
ax (FOr 6{ 2 So. hyp & and decreases rapidly though asymptoti- 
cally to zero as the difference €; — €g increases. 

However, let the situation which corresponds to N; < Ne, or ac- 
cording to (1), €1 < €2, be connected with a positive amount S, of 
satisfaction for the individual J (Rashevsky, 1959, chap. v). In 
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that case the anticipation of the situation NM; < No, or & < a, will 
result in an increase of excitation €» by an amount &S;, where 
& <1 is a constant which becomes equal to 1 only when the situa- 
tion N; < N¢ is not merely anticipated but actually occurs. Hence 
the probability P,, is now given by 


, —( &,—&,—0 5) 
Py = i p(A)dA,. (3) 


For the same value of &; — &2, PZ, > P,. Hence in this case the 
individual / will tend to underestimate the majority N;. The larger 
S;, the greater the probability of such an underestimation. If o de- 
notes the standard deviation of p(A) and if 


a S;-— €; + Eg >>9, (4) 


then R; will be close to unity. 

The mechanism of ‘‘wrong judgment’’ is based in this case on 
the inherent occurrence of wrong judgments. The frequency of this 
inherent occurrence is enhanced by S;. Otherwise we assume that 
the actual observations of the numbers \; and Np are correct. 

There is, however, a purely social factor which may result in a 
wrong evaluation of N; and Neg. Let 


N=N,+No (5) 


be the total population. Assume, for simplicity, a uniform social 
distribution of the individuals of both groups. The individual / in 
his daily work has to come into contact with a certain number n of 
other individuals. This number is roughly proportional to N. If a 
denotes a coefficient, then 


n= aN, (6) 
Consider the case of sufficiently large n. Then the individual / 
will come into contact on the average with 


N 
tes nm = aN, (7) 
N 


individuals of the majority group, and with aN individuals of his 
own minority group. This could give him a fair estimate of the 
ratio N;/Nz. However, every individual has the tendency to prefer 
to associate with individuals who share his views, if these views 
are strong and important. Outside his working hours the individual 
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[ will associate only with individuals of the minority group. These 
may come either from the number @N2 or from other individuals with 
whom / does not necessarily come into contact in his daily work. 
Let the total number of individuals with whom / associates be nj. 
We have 

nj > aNo. (8) 


Even if ny = @No, the actual number of contacts which / makes 
with individuals of the minority group per unit time is greater than 
aNz, because I meets the same individuals outside his working 
hours while he avoids the individuals of the majority group. Thus 
if nf and nj’ denote correspondingly the number of contacts which / 
makes with the individuals of the two groups, it is possible that 

ny > nj (9) 
even though N; > Ne. 

If each contact acts as a stimulus in the evaluation of N; and N2 
correspondingly, the individual / will, if (9) holds, make the wrong 
estimate, namely that N < No. 

We may consider each contact as being on the average of a dura- 
tion T,-1 and T,2 correspondingly. Denote the interval between 
contacts by T;2 and T;1 correspondingly. We have 


‘Toe Care ts RS SO Shc (10) 


The series of contacts may be considered as a series of inter- 
mittent stimuli, and the average excitation €; and &2 due to them 
may be obtained by using a previous expression (Rashevsky, 1960, 
chap. ix, eq. (22)). Those values for €: and €2 are then introduced 
into (3). Thus Pj becomes dependent not only on the biological 
parameters contained in p(A) and in S; but also on the purely so- 
cial parameter contained in T.1, Te2, Tyi and T;2. 


This work has been aided by a grant from the Wallace C. and 
Clara A. Abbott Memorial Fund of the University of Chicago. 
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_ In the paper ‘‘Abstract Mathematical Molecular Biology’’ by N. 
-Rashevsky, Vol. 23, pp. 237-260, the fourth line from the top of 
page 253 should read: 

9, in other words the reaction )—>8, while Dg catalyzes C—>V. 
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ANNOUNCEMENT 


POST- AND PREDOCTORAL 
FELLOWSHIPS IN MATHEMATICAL BIOLOGY 


Under a training grant from the United States Public Health Serv- 
ice, a number of Post- and Predoctoral Fellowships are now avail- 
able in the Committee on Mathematical Biology of The University 
of Chicago. Stipends are provided at various levels, depending 
on the previous education and experience of the trainee, and in- 
clude allowances for dependents. The tuition fees are also paid 
to the Predoctoral Fellows. All Fellowships are exempt from fed- 
eral income tax. 

The Committee on Mathematical Biology provides office space 
and all training and research facilities for the Fellows. 

The Postdoctoral Fellowships offer a unique opportunity to in- 
dividuals holding a Ph.D. degree in other sciences to receive 
training in Mathematical Biology and to do research in that field. 

Those interested should send their applications to Professor N. 
Rashevsky, Chairman, Committee on Mathematical Biology, The 
University of Chicago, 5741 Drexel Avenue, Chicago 37, Illinois, 
and include the following information: 


1. Curriculum vitae. 

2. Number of dependents. 

3. The particular branch of Mathematical Biology in 
which the applicant is interested. 

4, Scholastic record. 

5. List of publications, if any. 

6. Three references. 

7. Date at which the applicant desires to begin his work. 


419 


~ mal? 


tly) waly ve ‘hyn? Oy 8 gee RR? gery” 
aS aie ae serene is 
nm <4 rl wren wal fe é ork 


fae at 09 our ee ioe songs ran ES 


aa 


ANNOUNCEMENTS 


SYMPOSIUM ON REDUNDANCY TECHNIQUES FOR 
COMPUTING SYSTEMS 


A symposium on Redundancy Techniques for Computing Systems will 
be held on February 6 and 7, 1962, sponsored by the Information Systems 
Branch, Office of Naval Research. This Symposium will be held in the 
Department of Interior Auditorium, C Street between 18th and 19th Street, 
N. W., Washington, D. C. 

The objective of the Symposium is to focus attention toward new 
ideas, research and developments which may lead to the introduction of 
redundancy techniques into forthcoming computing systems. It is apparent 
that many information processing systems of the future will be of such 
size that conventional methods of fabrication and of maintenance will be 
completely infeasible. Computers for space applications have reliability 
requirements which are presently not achievable without use of redun- 
dancy. Thus, some form of logical redundancy must be introduced if 
practical systems are to be operated. Accordingly, it is planned to pre- 
sent in a single symposium a collection of related papers concerning 
suggested new techniques for the use of redundancy in computing sys- 
tems. This should permit relative evaluation of such techniques with 
respect to their probable future utilization in various aspects of,the com- 
puting field. The program will consist of papers invited from many of the 
organizations engaged in appropriate research and development activities. 

Attendance is open to all interested technical personnel. Further in- 
formation and a preliminary Symposium program, when available, may be 
obtained by contacting: Miss Josephine Leno, Code 430A, Office of 
Naval Research, Washington 25, D. C. 


SYMPOSIUM ON OPTICAL CHARACTER RECOGNITION 


A Symposium on Optical Character Recognition will be held on Janu- 
ary 15, 16, and 17, 1962. It will be jointly sponsored by the Information 
Systems Branch of the Office of Naval Research and the Research In- 
formation Center of the National Bureau of Standards. The Symposium 
will be held in the Department of the Interior Auditorium on C Street, be- 
tween 18th and 19th Streets, N.W., Washington, D. C. 
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The objective of the Symposium will be to bring together equipment 
specialists, scientists, and potential users who are interested in ex- 
ploiting automatic character recognition techniques for a variety of appli- 
cations. Major purposes are to explore, for mutual benefit, the present 
state of the art, operational requirements and the implications of current 
research. It is planned that the meeting will be composed of several 
sessions to cover, respectively, presentations on available equipment, 
directions in current research, representative requirements of potential 
users, and a forecast of future progress. The program for these sessions 
will consist of invited papers from individuals and organizations active 
in research and development efforts in the field, and from potential users. 
Where feasible, demonstrations of optical character recognition systems 
in operation in the Washington area will be arranged. 

Attendance is open to all interested technical and management per- 
sonnel. Further information and a preliminary Symposium program, when 
available, may be obtained by contacting: Miss Josephine Leno, Code 
430A, Office of Naval Research, Washington 25, D. C. 
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